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ANTIBODY CONSTRUCTS WITH CDH SWITCHED VARIABLB REGIONS 



Background r>f ^ TnY f nM nn 



agents usJ ^ inV<mti ° n relates to ^gen binding molecular 
agents useful as diagnostic or imaging and 

therapy agents. More specifically, the invention relates to 

the preparation of antibody-derived proteins useful for 
dxa gnos ^ Qf cance ^ ^ 

lesaons, infections, a „ d other pathological states. In 

L efLciln^ W1 rSdUCed Pities which can 

be efficiently expressed in eukaryotic cells 

Native antibodies are comprised of four protein 
chains, two shorter 'light' chain* . , Protein 
rh a i« e „u u chains and two longer -heavy" 

chains. The chains are associated in a specific three 
dimensional structure. Each of the four chains consists of a 
series of linked domain structures. 

These domains are structurally related 

™LTcV " rUCtUral * taOKn " Che Globulin 
variable C6nCainS ^ VarlabU d0Min ' «~« by a 

Il\ eX ° n ' " ,J ' ° £ COnStant d <»»^- encoded^ 

z„ r ons - the beins dete ™ inea * —<*«• 

cham xs heavy or li B h t and. tor h«vy chain, detained by 



WO 96/06625 



PCMJS95/10791 



- 2 



10 



the class of heavy chain. The number of heavy chain constant 
domains is three for the most commonly occurring class of 
immunoglobulins, igG. The constant region of the light chain 
consists of a single domain, c L . when the variable domains 
are properly folded, according to the dictates of the protein 
sequence, the intact antibody provides a structure with 
specific binding properties, if the intact antibody molecule 
is envisioned as a Y-shape. the stem of the Y (Fc) is formed 
by surface complementarity of the C H - 2 , hinge, and C H - 3 
portions of the constant regions of the two heavy chains, 
which extend beyond the light chains, m addition, the two 
heavy chains are covalently linked through a number of 
disulfide linkages, the number of disulfide linkages varying 

15 ^rr/'^T 11 ' antib °*' ClaSS6S (i - e " I9G ' ^ Ig£. 
15 Ig A ) and subclasses (e.g. IgO^ iga 2 , igG 3 , lgG 4 ). The 

constant region of the gamma-l heavy chain, for example, 
includes three constant domains, C H . 1# c H - 2 . and C H - 3 , with 
C H -i linked to c H - 2 by an extended linker region called the 
hinge. The five classes of antibodies are determined in the 
mam by their differing heavy chains - thus the IgA, igD, 
I 9 E. igc and igM classes have alpha, delta, epsilon, gamma 
and mu type heavy chains, respectively. Each of these types 
of heavy chain are characterized by having generally 
conserved amino acid sequences in their constant domains and 
hinge regions, regardless of the antigen to which they bind 
There are additionally two classes of light chains, lambda 
and kappa, the latter being more abundant in many mammalian 
specxes including mouse (ratio of kappa:lambda of 90:10) and 
human (ratio of kappa:lambda of 60;40). as with the heavy 
chains, each class of light chain has a generally conserved 
constant domain sequence regardless of the antigen to which 
the variable domain of the chain binds. 

The variable domains are complementary, so that one 
heavy and light chain pair joins to form each arm of the 
35 antibody. Thus, the amino terminus of each arm contains a 
regxon (Fv) containing the antigen binding variable domains 
of one light and one heavy chain. Each variable domain 



20 



25 



30 



WO 96/06625 



PCIYUS95/10791 



- 3 - 



contains three complimentary determining regions (CDRs) 
characterized by highly variable protein sequences between 
different antibodies. Each CDR is framed by two of the four 
framework regions (FRs) present in each variable region, thus 
creating an alternating sequence of FR-CDR-FR-CDR-FR-CDR-FR- 
( constant domain) . 

Antibody specificity and affinity are governed by 
the sequence and structure of the CDRs. Outside of the CDRs 

in I*'*' Wlthin ^ FRS) ' Variable domains of the light and 
10 heavy chains have the same general structure, albeit with 

noticeable and functionally significant differences in 

sequence. The four FRs largely adopt a fi-sheet conformation 

and are joined by connecting loops which incorporate the 

CDRs. The CDRs are held in close proximity by the FRs. Note 

15 that it is not always necessary to have complementary pair 

variable domains from one heavy and light chain to obtain 

binding, as is found in native antibodies. Ward, ec al 

3*1:544-546 (1989,, demonstrated that some V„ domains 

by themselves have the capability of binding antigens. 

^ a™ „ r^ 0 ^ f0rmS ° f antib0dies fragments 
are known for use in delivering drugs and toxins to specific 

IT ^1" b0 ^ y - Siailariy < radiolabeled antibodies 
and antibody constructs can be administered in vi™ for 

25 ^ e o C th" 9 T iWa9in9 ° r tUm ° rS ' thrombi ' ***eti«. 

and other disease states. These immunotherapeutic and 

imaging agents target a binding site on a particular tissue 
or cell type, for example, a specific antigen associated with 
a tumor or thrombus. As a result, other tissues or cells do 
not accumulate the attached radioisotope, drug or toxin to 
30 the same extent. Thus, the risk of toxicity to nonnal tissue 
during systemic administration of drugs and radiolabels is 
considerably lessened, and concomitantly the dose of the 
therapeutic agent may be lowered. 

Another approach in the case of antibodies for 

ZTZ^T diagn ° SiS iS C ° USe <BltlB « B bindin * ^agments. 
Antibody fragments display more rapid specific targeting 

less non-specific accumulation in the liver and spleen (due 
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to the absence of the Fc portion) , and a faster rate of 
clearance from the blood stream than intact antibodies. Due 
to these characteristics, antibody fragments permit the use 
of radioisotopes with short half lives, such as 99t^ c 
5 Rh. and the like, as well as isotopes with longer half 
lives such as 90y and lllja. 

The greatest amount of information to date has been 
obtaxned with antibody fragments which have been produced by 
enzymatic digestion of antibodies, with or without chemical 
reductxon. Digestion with papain cleaves the molecule above 
the hxnge region, containing the interchain disulfide bonds 
ixnkxng the two heavy chains. The resultant fragments 
xnclude two identical Fab fragments, containing Se heavy and 

15 tT* Chain Variable *>*ain S . referred to generally by the 
15 abbrevxatxons v H and V L , respectively, the light cLZ 
constant domain. c L , and the first heavy chain constant 
domaxn, c^, as well as a small portion of the hinge region 
When the intact antibody is digested instead with the 

20 dxsulfxde bonds of the hinge region and results in a bivalent 
molecule having the ^ regions from both arms linked £ the 
dxsulfxdes xn a larger segment of the hinge than in the^ 
The ^resulting fragment is called an F(ab., 2 fragment. Upo t 
reductxon of the disulfide bonds, the P(ab.) 2 fragment 

process T ^ the c ^avage 

ZZl reSUltS *" ^ — a ^ficant loss of 

bindxng properties. (See Wahl. et al. , j. w.„ 

21:317-325, 198 3>. Therefore, the search l^~tor 
targetxng molecules having specificity, enhanced binding 
actxvxty. minimal non-specific binding, and a shorter half- 
Ufe oa^ova chan xntact antibodies. This is especially true 
for xn vivo diagnostic (imaging) applications. 

While antibody fragments have advantages for many 
applxcatxons, the intact antibody has advantages for many 

umir iC T° aCheS ' Nak6d ~ tbLnr 

utxlxzxng antxbody molecules which are not coupled to drugs 

radxoxsotopes. or toxins, often requires effector functions 
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located in the Pc portion for action. This Pc portion is 
absent from most fragments, m addition, radioimmunotherapy 
may be more effective with intact molecules as the total dose 
delxvered is a function of residence time at the tumor, which 
■ is uniformly higher for intact antibody molecules over 

fragments due to the same factors that cause fragments to be 
more rapidly cleared from the blood stream. 

It is also possible to directly express 
immunoglobulin deletion mutants such as Pab or F(ab« > 2 -like 
fragments, using recombinant DNA techniques, m one such 
procedure an Pd' fragment (i.e. the portion of the 
immunoglobulin heavy chain found in the Fab , molecule, was 
expressed in B. coli (Cabilly, etal..^^.^ 
£1:3273-3277 (1984,,. ward, ec al., in -Binding Activities 

sLr.!Tr 0ire ° £ Sin9le ************ variable Domains 
Secreted from Esderia coli-. m:54 4- 5 46 (1989,, also 

describe expression of isolated heavy chain variable domain 
genes from E. coli to form a type of binding fragment known 

Z l,T d ° raain antibod y-" m another method described 
oy Gillies, PCT Patent Application No. PCT/US91/00633 
specific constant region domains of the human gamma heavy 
chain, such as the C H - 2 domain, were eliminated to ell 
the binding activity and eliminate effector functions (such 
as complement activation and Pc receptor binding, of the 
recombinant molecule over that of the native antibody. 

Ideally, human antibodies and antibody fragments 
would be used for immunotherapy and immunodiagnosis of humans 
in order to avoid the undesired immune responses often caused 
by administering non-human immunoglobulins to them. However 
human antibodies of appropriate specificity and affinity are' 
difficult to obtain. Por instance, conventional hybridoma 
techniques yield species hybrid cell lines that are 
frequently unstable and often produce igM antibodies, instead 
of the more desirable i gG class of antibodies. An ig M 
molecule is expressed primarily as a pentamer made up of five 
identical S ubunits (ig„ monomers,, each containing two heavy 
and two light chains. i gM monomers have, as a rule 
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affinities that are too low for therapeutic and imaging 
applications. Therefore, methods utilizing genetic 
engineering have been developed for -humanizing- non-human 
immunoglobulins, i» the initial attempts, chimeric 
> antibodies were fashioned by replacing the entire variable 

domain of human antibodies with those of another species 
(usually murine,. ( See Morrison, et al.. European Patent 

Application No. ep 0 173 494 and pct t>^ a ^ » , • 

PCT/usQi/mo,.,, , Patent Application No. 

PCT/US91/01844., However, many chimeric antibodies have 

Cn n pZT niC b6CaUSe C ° ntain SUfficient in- 

human protein sequences to generate an immune response 

of human ,T * ***** tOWardS " humani ^tion, - the CDR's 

of human (acceptor) antibody species have been replaced by 
those of another (donor, species, so that the framework 
regions and the constant domains are entirely or 

7T2ll hXU ? ^o* 10 ^ - -ly ^e CO* portion 
APPlLtLn^T antibody is non-human (see European Patent 
Application Publication No. ep 0 239 400 by Winter, et al., 

ca als" 1 ! T' COmm ° nly ^ " — ^fted 

can also be made as antibody fragments. (See winter, et al 

flair, et al., PCT Patent Application No. PCT/GB91/ 01108, or 

/ m ^ antib ° dieS n ' s ' Patent No " 

(8/7/91, i SSU ed to Ladner. et ai., and U.S. Patent Nos 
.1 ,05 (7/21/92, and 5,091,513 (2/ 25 /92, issued to Huston, 

acce D ; o ; 9rafting ° f the d ° n0r CDR -to the 

acceptor protein framework can displace the donor binding 

a n°it S v° Ut °H ^ ° Ptimal "~*>™*™ -d impair binding 

Patents v ^ « *» PCT 

Patent Application No. PCT/GB90/02017. disclose a method for 

I 1 !! 6 CDRS " ^ ^ -formation by replacing 
certain key amino acid residues in the acceptor antibody 
framework regions to agree with those residues in 
corresponding regions of the native donor antibody. This 
Proceu increases the binding efficiency of 
but at the same time can increase the immunogenicity of the 
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construct, since non-human residues are introduced into the 
human part of the construct. 

in addition to the above problems in humanizing a 
non-human antibody, the process of producing vectors 
containing genomic DNA for encoding humanized antibodies has 
Proven difficult due to the size of these human genes. 

Accordingly, there exists in the art a need for 
more and better genetically engineered antibodies with 
lowered immunogenicity but with sufficient antigen-binding 
affinity and specificity to be useful for in vivo detection 
of disease, for therapy, and for a combination thereof, such 
as for tumor imaging and cancer therapy. The need also 
exists for recombinant antibodies that are easily expressed 
xn prokaryotic or eukaryotic host cells in commercially 
useful quantities and which accumulate in normal tissue in 
acceptably low amounts. Particularly of interest are 
recombinant antibodies with reduced immunogenicity (for 
instance a CDR-grafted antibody, and fragments thereof, 
comprised of human framework regions and constant domains, 
that bind quickly to their target sites and have other 
preferred pharmacokinetic properties. 

Since smaller forms of antibodies, such as 
fragments, are less immunogenic than large intact antibodies, 
the combination of CDR grafting with small molecular size 
offers significant advantages for most * ^ applications. 
However, .ntact forms also have advantages in applications, 
such as ^immunotherapy, where long residence times at the 
tumor are essential for maximum therapeutic effect. 

Another approach to overcoming immunogenicity is 
the development of multiple reagents having common binding 
characteristics, but different structures. For example, use 
of different human frameworks with the same CDRs provides a 
different overall surface to the host immune system. More 
directly related to the current invention, use of frameworks 
from different human immunoglobulin chains provides unique 
molecular structures, either light chain CDRs with heavy 
chain frameworks or vice versa. 
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The multiple reagents described above can be used 
in at least three ways. First, employing different molecular 
forms in consecutive rounds of therapy can decrease the 
likelihood of generating an immune response to any one form. 
Similarly, administering a cocktail combining various forms, 
decreases the amount of any individual form administered, 
again decreasing the likelihood of a specific immune 
response. Finally, alternate molecular forms can be held in 
reserve, to be administered after an immune response develops 
to the first form administered. 

There exists a need for recombinant antibodies with 
increased specificity. These higher specificity antibodies 
should be expressed from mammalian cells in order to have the 
proper glycosylate, and should be expressed by the cells in 
15 practical amounts, in order to impart desirable 

pharmacokinetic properties, it is further desirable that the 
recombinant antibodies be fragments of whole antibodies. 
Finally, it is desirable that these recombinant antibodies be 
as non-immunogenic as possible. This goal can be 
accomplished by reducing the size of the construct, by 
humanizing the construct to the extent possible, and by 
replacing heavy chain framework regions with light chain 
framework regions. 

Many of the novel molecules embraced by the present 
invention provide multiple small, humanized forms, which are 
structurally distinct from native and other recombinant types 
of humanized antibodies and their fragments, but conserve 
affinity and specificity. 
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SUMMARY OF THF TNVBWPTnw 

The present invention encompasses a recombinant 
antibody or fragment thereof, and DNA and RNA sequences 
therefor, comprised of at least one light chain variable 
domain, which domain, in turn, comprises three CDRs wherein 
one or more of the CDRs is derived from [identical to or 
closely resemble (s)] the amino acid sequence of the 
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corresponding CDR(s) of a heavy chain variable domain of one 
(donor) antibody and further comprises four framework regions 
wherein one or more of the amino acid sequence of framework 
regions are derived from the amino acid sequence of the 
corresponding framework region (s) from the light chain 
variable domain of the same or a different (acceptor) 
antibody, and pharmaceutical compositions containing such 
antibodies or fragments. 

The invention also encompasses DNA sequences 
encoding such recombinant antibodies or fragments thereof, 
and vectors containing these DNA sequences in addition to 
host cells transfected by these vectors. 

BRIEF DRaewTP'P TffH op .reft PTanmtfl 

recombinant?"" 1 ^ * Schenatic representation depicting a 
recombinant fragment defined herein as a CSV L fragment, in 
the example depicted, the CSV L fragment is fused at the 

llT en tl S t ^°" Y terminUS t0 a PSptide that chelat *s -etal 
ions. The illustrative CSV L fragment also consists of all 

four framework regions from the V L domain of an acceptor 

antibody and all three CDR regions from the V H domain of a 

donor antibody. 

Figure 2 is a schematic representation depicting a 
recombinant fragment defined herein as a Heavybody. The 
Heavybody consists of a CSV L fragment and a C L domain. 

Figure 3 is a schematic representation depicting a 
recombinant fragment defined herein as a Kappabody fragment. 
The Kappabody fragment has two chains: one a Heavybodv and 
the other a CDR-grafted light chain. Preferably, the two 
chains are connected by a disulfide bond. 

Figure 4 is a schematic representation depicting a 
recombinant molecule defined herein as an intact Kappabody 
This molecule comprises two heavy chains, wherein both of the 
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heavy chain variable domains have been replaced by csv L 
fragments, and two light chains, wherein both light chains 
are CDR-grafted. 

r~ l ■ Fiffttre 5 is a sc ^matic representation depicting a 
remnant molecule defined herein as an Scp v -csv l Lgment 
As the title implies, the Figure depicts a CSV L fragment 
bound by a short peptide linker to a CDR-grafted V L domain. 

of 6 Pr ° VideS 3 Unear 3rray ° f the sequences 

of light chain variable regions of eight antibodies whose 

atomic coordinates have been deposited in the Brookhaven 

Protein Data Bank (pdb, . The identifiers used in this Figure 

Table 2. The sequences contained within bold boxes 

S r CR5 e e S n!l CO T SUS SCRS - ^ light 1,0X68 — iated with 
SCR5 enclose the SCRs common only to FB4 and each individual 
sequence of the array. The NSCRs in each sequence are found 
I"" ■ a,U ~ Se9ments ° utsi *> of (and between, except for 

intr d 7 NSCR 7 ' C> ^ b ° Xd b6X6S - *** -present gaps 
introduced into the sequences in order to align the columns 
in the array. 
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of h r ±attre 7 Pr ° Vides a linea * -rray of the sequences 

BrooKh? r VariabU r63i0nS ° f 6i3ht f ™ ^e 

Brookhaven Data Base. The Brookhaven antibodies are referred 

within bold boxes represent consensus SCRs. The light boxes 
associated with SCRl enclose the SCRs common to only FB4 and 
each individual sequence of the array. The NSCRs in each 
sequence are found in the sequence segments outside of (and 
between, except for NSCR N.l and NSCR 10.C, the bold boxes 

TlTJTlT ^ intr ° dUCed int ° the Se9Ue — in 
align the SCRs in the array. 

lioht eh • 8 ^ S6qUenCe ^ ° f the 2 <*025 

light chain variable region aligned with the Brookhaven 
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sequences shown in Figure 6. The segments of ZCE025 
corresponding to the consensus SCRs are contained within bold 
boxes. Rabat defined CDR residues are in bold. CDR- 
associated residues are in bold italics 

5 

Figure 9 provides the sequence array of the ZCE025 
heavy chain variable region aligned with the Brookhaven 
sequences shown in Figure 7. The segments of ZCE025 
corresponding to the consensus SCRs are contained within bold 
10 boxes. Kabat-defined CDR residues are in bold. CDR- 
associated residues are in bold italics. 

Figure 10 provides a sequence array in which the 
sequence of no light chain variable region has been aligned 
with the Brookhaven sequences shown in Figure 6. The IM9 
segments corresponding to the consensus SCRs are contained 
within bold boxes. 
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Figure 11 depicts a sequence array in which the 
sequence of IM9 heavy chain variable region has been aligned 
with the Brookhaven sequences shown in Figure 7. The zm9 
segments corresponding to the consensus SCRs are contained 
within bold boxes. 

Fijrore ia The variable region of the CSVL(hb) 
containing the light chain variable region of IM9 grafted 
with the Kabat-defined cdrs from the heavy chain of ZCE025, 
alxgned with the heavy and light chain variable regions of 
IM9 and ZCE025. Structurally homologous regions between 
pairs of antibodies are enclosed by boxes. 

Figure 13 shows the amino acid sequence of the IM9 
light chain variable domain CDR-grafted with CDR's derived 
from the heavy chain of ZCE025. Lower case letters represent 
residues from no human v K domain; upper case letters 
represent residues from ZCE025 murine V H domain; e represents 
a glycosylate site; * designates CDR-supporting framework 
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residues from the donor antibody; $ designates residues 
involved in domain association and S designates residues that 
are common to both the Vjj domain of ZCE025 and the V K domain 
of IM9. 

Figure 14 is a restriction map of the 9 Kb BamHl 
fragment containing the IM9 kappa gene in bacteriophage 
lambda EMBL3 . The Mboj termini generated by the partial 
genomic digest, were reconstructed as £amHi sites. The left 
and right lambda arms are 20 and 9 Kb, respectively. The 
exons are represented by solid boxes. 

Figure IS is a restriction map of pBluescript®KS- 
( commercially available from Stratagene Cloning Systems, 
11099 North Torrey Pines Road, La Jolla. CA 92037) containing 
IM9 kappa SamHI/asLEii insert from the 5 '-end of the IM9 
kappa gene subcloned from the 9 Kb BamHl fragment of Figure 
14 . The fiatEIl site was eliminated by filling in the 5 • 
overhang and cloning into the EcaRV site of pBluescript®KS- . 
The exons are represented with solid boxes and the Ampr gene 
is represented with a box. 



Figure 16 is a map showing the primers for overlap 
PCR mutagenesis of the 1M9 kappa gene 5 '-end from BamHl to 
25 fisLEll. The two sets of primers flanking the variable exon 
specify the addition of sfi sites on each side of the exon. 
The location of the Mstii site ablation is indicated 5 ' to 
the open box representing signal exon I. 

30 Figure 17 is a restriction map of the IM9 kappa 

expression vector pGIM9kappa. Coding regions are represented 
by stippled boxes with arrows indicating the direction of 
transcription. In clockwise order from the Cla i site, the 
vector consists of the following fragments: a ela i - BamHT 

35 fragment containing the ampicillin resistance gene, the SV 40 
promoter, the mycophenolic acid resistance gene, and the SV 
40 polyadenylation site; and a BamHl - Cla T fragment 
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containing the IM9 kappa promoter, the IM9 kappa signal exon, 
the IM9 kappa signal intron, the IM9 kappa variable exon, the 
IM9 kappa major intron, including the kappa enhancer, the IM9 
kappa constant region exon. and the IM9 kappa polyadenylation 
site plus 3 Kb of downstream sequence. 

Figure 18 shows a restriction map of the 
pGlM9k/hzCE(CSV L ) -kappa expression vector, in clockwise 
order 5- to 3' are: the BamHl to Sfil fragment containing the 
IM9 light chain promoter and signal exon; the Sfil to Sfil 
fragment containing the CSV L exon and the 3' end of the major 
intron; the Sfil to Mstll fragment containing the remainder 
of the major intron (including the IM9 light chain enhancer), 
the IM9 c k constant exon, and the IM9 kappa 3* untranslated 
region; and the Mstll to BamHl fragment containing the 
pSV2gpt (enhancer minus) vector. The solid boxes with arrows 
indicate open reading frames. 

ft PBTATTiKD DBflCR tptiom <w XHB TMv»ffT TftH 

The present invention embraces genetically 
engineered CDR-grafted recombinant antibodies or antigen- 
binding fragments comprised of at least one CDR switched 
light chain variable domain (hereafter referred to as a 
"CSV L - fragment or domain) , which domain, in turn, comprises 
three CDRs wherein the amino acid sequence of one or more of 
the CDRs is derived from the amino acid sequence of the 
corresponding CDR(s) of a heavy chain variable domain of one 
(donor) antibody and further comprises four framework regions 
wherein one or more of the framework regions are derived from 
the amino acid sequence as the corresponding framework 
regions (s) from the light chain variable domain of the same 
or a different (acceptor) antibody.' The recombinant 
antibodies, and the corresponding antigen-binding fragments 
thereof, will be referred to collectively herein as -CSV L 
recombinant antibodies". It will be understood by one 
skilled in the art that the CSV L recombinant antibodies can 
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contain CDRs and FRs from donor and acceptor antibodies of 
widely divergent origins. Thus, the donor and acceptor 
antibodies do not have to be from the same species, and 
whether they are from the same species or not they certainly 
5 do not have to be of the same class or subclass. Thus, one 
could use a murine Ig-alpha donor antibody and a rabbit Ig- 
gamma acceptor antibody to construct a CSV^ fragment of the 
instant invention. Similarly, one could use a murine IgG-2a 
donor antibody and a human lgG-4 acceptor antibody to 

10 construct such a fragment. 

Five types of CSV^ recombinant antibodies comprise 
the preferred embodiments of the present invention. The 
first is the CSV L fragment itself (see Figure 1); the second 
is a single chain derivative termed a "heavybody* (see 

15 Figure 2), which is composed of a CSV^- containing fragment 
fused through the C-terminus to the N-terminus of a light 
chain constant domain. A third preferred embodiment is 
termed a kappabody fragment, which comprises a heavybody 
chain combined with a CDR-grafted light chain, preferably 

20 covalently linked by a disulfide bridge between the two light 
chain constant domains (see Figure 3). The latter light 
chain differs in general from its CSV L counterpart in that 
the CDR-grafted chain has CDRs derived from a donor light 
chain variable domain substituted for the native CDRs in the 

25 acceptor light chain variable domain, versus substitution 

with donor heavy chain CDRs in the case of a CSV L domain. A 
further preferred embodiment is termed an intact kappabody 
(see Figure 4). The intact kappabody resembles an intact 
CDR-grafted antibody (with all four variable domains having 

30 at least one CDR replaced with a non-native CDR of the same 
type of chain (i.e. heavy or light); differing in that the 
two CDR-grafted heavy chain variable domains are replaced by 
two CSV L domains. The fifth preferred embodiment is termed a 
single chain chain- switched variable fragment and is defined 

35 as a CSV L domain bonded to a CDR-grafted light chain variable 
domain throughout a short peptide linker, generally no more 
than 25 amino acid residues (see Figure 5) . The symbol used 
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in the Specification for this embodiment is -ScFv(CSV L ) - . 
The C-terminal end of the CDR-Grafted V L domain can be fused 
to the N-terminus of the CSV L domain through the peptide 
linker, or vice versa. 

As with the CSV L fragment, one skilled in the art 
will realize that the Heavybody, the kappabody fragment, the 
intact kappabody and the ScPv(csv L ) fragment offer a wide 
array of choices for donor and acceptor antibodies. Thus, 
taking the heavybody as an example, the donor antibody could 
be a murine i gAl , the Framework Region (s) and the C L could be 
from a sheep igM acceptor antibody. Taking this principle 
one step further, in the case of an intact kappabody, the 
present invention contemplates the expression of a molecule 
having one lambda and one kappa chain, regardless of whether 
they were of the same species, or a molecule having two kappa 
or two lambda chains of different species. To insure proper 
disulfide bridging, heavy chain acceptor antibodies of an 
intact kappabody are preferably of the same species, class 
and subclass. 

The five illustrative generalized preferred 
embodiments have several common, more preferred embodiments 
For instance, it is preferred that the donor and acceptor 
antibodies for these five constructs have donor and acceptor 
antibodies that are different and that are chosen from 
murine, rabbit, or primate monoclonal or antibodies. 
Furthermore, it is preferred that all of the CDRs in the 
various CSV L and CDR-grafted V L domains, as the case may be, 
are identical in amino acid sequence to the corresponding 
CDRs of donor antibody cdrs,- that all of the framework 
regions are derived from the same amino acid sequence as, 
(i.e., being at least about 75% and preferably at least 85% 
homologous to) the corresponding framework regions of the 
acceptor antibody (ies) ; and that any constant domains, 
whether light chain or heavy chain, as the case may be, are 
identical in amino acid sequence to the corresponding domains 
of the acceptor antibody (ies) . In order to make these 
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preferred constructs less immunogenic, it is further 
preferred that the acceptor antibody (ies) be human, 
especially a human antibody that has light chains of the 
kappa class, and more so when the human heavy chains are of 
5 the gamma class* (It is understood that the class and 

subclass of the two heavy chains in an intact kappabody are 
preferably the same in order to obtain optimal disulfide 
bridging between the two chains.) With regard to the 
ScFv(CSV L ) fragment, when the acceptor antibody is human, it 

10 is preferred that the linking peptide be from about 12 to 

about 18 amino acid residues, and especially so when the CDR- 
graf ted V L domain is fused to the N-terminus of the 

polypeptide linker, and wherein the C- terminus of the 
polypeptide linker is fused to the N-terminus of the CSV L 
15 domain* 

Further preferred embodiments of this invention 
occur when murine monoclonal antibodies are used as donor 
antibodies, and more so when these murine antibodies have 
binding affinity, and thus were raised against, tumor 

20 antigens and antigens on thrombi; but especially so for tumor 
antigens of human but also of any other vertebrate origin. 
Preferred tumor antigens; (or markers as they are sometimes 
called), are AFP, CA-125, CEA, Neuron Specific Enolase, 
C-erb2/Her-2/NEU protein, Cathepsin D, Chromagranins A, B, 

25 and C, the Cytokeratins, Epidermal Growth Factor Receptor, 
Epithelial Membrane Antigen, Estrogen Receptor, Progesterone 
Receptor, Prostatic Acid Phosphatase, Prostate Specific 
Antigen, Ki-67, PGP-170 (a multiple drug resistance marker), 
Proliferating Cell Nuclear Antigen, Vimentin, and the 

30 proteins expressed by the c-myc, N-myc, N-ras, Ki-ras and 
Ha-ras oncogenes. An especially important tumor antigen is 
CEA, with preferred murine donor antibodies being the anti- 
CEA antibodies ZCE 025 (C. M* Haskell, et ai. , Cancer 
Research. 41,3857 (1983), who refers to the antibody as *MAB 

35 035") and CEM 231 (C.B. Beidler, et al., J. Immunol nm/. 

141,(11), 4053 (1988)). Regarding the ScFv(CSV L ) fragment, 
when the donor murine antibody is an ant i -CEA antibody, it is 



WO 96/06625 



PCT/US95/10791 



17 



15 



20 



25 



30 



35 



further preferred that the peptide linker be composed of 
serine and glycine residues.) with the latter two anti-CEA 
donor antibodies, it is preferred that the acceptor antibody 
be the human IM9 antibody. {Reference under Bi-9 in ATCC 
» #159) wherein the framework regions in the CSV L and CDR- 
grafted light chain domains, as the case may be, are mostly 
the same in amino acid sequence as the corresponding IM9 
framework regions. The most preferred donor antibody is 
ZCE025. Finally, with regard to the ScFv(CSV L ) fragment, 
when the donor antibody is 2CE025, it is preferred that the 
peptide linker have the amino acid sequence -GGSGGSGGSGGSGG- 
(Sequence i.d. n 0 . 1) . 

Each of the above five preferred embodiments can 
optionally have fused to its C- or N- terminus a metal- 
chelating peptide sequence. The chelating peptide sequence 
can be up to about twenty-five amino acid residues in length, 
in the case of the CSV L and the ScFv(csv L ) fragment, only one 
such peptide chain is bound to either available terminus, in 
the case of the kappabody fragment and the heavybody. the 
chelating peptide can be bound to either one or the other, or 
both, chains, and when bound to both chains, can be bound to 
either the N-termini, the. c- termini, the C-tenninus of one 
chain and N-terminus of the other, or to both termini of both 
chains. With the intact kappabody. a chelating peptide such 
as that described above can be bound to any number of the 
four chains comprising the molecule, with any and all 
combinations of N- termini and C-termini bonding envisioned. 
For any one of the five preferred constructs, it is further 
preferred that metal chelating peptide consist of about ten 
amino acid residues or less and chelate to either nickel (+2) 
zmc( + 2). copper ( + 2), or cobalt( + 2) ions and be bonded to one 
or more, as the case may be. of the c-termini of the 
molecule. More preferred is the case where one (or more) of 
the C-termini is fused to a metal chelating peptide of the 
sequence HWHHHP (Sequence I.d. Nt> . 2) through the 
peptide's N-terminal histidine residue. Regarding the 
Preferred embodiment of any of the five preferred constructs 
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as described above, when the embodiment is narrowed to a 
murine donor antibody, it is preferred that a metal-chelating 
species be bonded to the C-terminus (or possibly more than 
one termini , as is applicable), consist of ten or less 
5 amino acid residues, and chelate with either nickel (+2), 
copper (+2), zinc (+2) or copper (+2) ions. Finally, in the 
most preferred embodiment of the above five constructs , that 
is, wherein the donor antibody is the murine monoclonal 
antibody ZGE 025, it is preferred that the optional metal 

10 chelating peptide have the sequence HWHHHP and be fused to 
the C-terminal (or one or more termini , as is applicable) of 
the molecule. 

The present invention also comprises the RNA and 
DNA sequences coding for any molecule therein, including but 

15 not limited to the five preferred constructs and their 
corresponding preferred embodiments. 

The present invention also comprises antigen- 
binding fragments of any of the above molecules that can be 
obtained by routine chemical and enzymatic manipulation, such 

20 as the fragments resulting from the chemical cleavage of 

bridging disulfide bonds, (e.g. using 2-mercaptoethanol and 
iodoacetate) , and from enzymatic digestion with routine 
reagents such as pepsin and papain. For instance, it is 
within the scope of the present invention to have an F(ab'>2 

25 fragment obtained from the digestion of an intact kappabody, 
and any of its preferred embodiments described above, with 
pepsin , or an Fab fragment obtained from the digestion of it 
with papain. 

The CSV L recombinant antibodies of the present 

30 invention contain one or more heavy chain CDR(s) from a donor 
antibody grafted into a kappa or lambda chain variable 
domain. The immunoglobulin chain containing the CSV L can 
further contain either a kappa or lambda constant region, or 
one or more alpha, delta, epsilon, gamma or mu constant 

35 region, depending upon its intended use. As mentioned above, 
gamma constant regions are preferred for this invention, and 
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especially preferred are the constant regions of the gamma- 1 
subclass. 

As used to define and delineate the scope of the 
present invention, the term -CSV L recombinant antibodies- 
shall mean both a CSV L fragment and a CSV L -containing 
antibody or fragment thereof, including a Heavybody, an 
ScFv(CSV L ) fragment, an intact Kappabody, and a Kappabody 
fragment . 

In the various constructs of the present invention 
the antibody that provides the framework regions into which 
are grafted CDRs from another antibody is referred to as the 
-acceptor antibody." The antibody that provides the CDRs 
grafted into the acceptor antibody is referred to as the 
•donor antibody, m one embodiment, the amino acid sequence 
in the four framework regions of the acceptor antibody are 
substantially homologous (i.e. at least about 75% homology) 
to the corresponding regions of the native acceptor 
antibodies, m another embodiment, the protein sequences in 
the framework regions of the acceptor antibody are altered, 
for example, by means of computer modeling, to preserve 
certain amino acids from the donor antibody that are 
necessary to conserve the binding affinity of the CSV L 
domains and the CDR-grafted light chain domain and the 
ability of the hybrid immunoglobulin chains containing the 
altered variable domains to associate and assemble with other 
such immunoglobulin chains into antibody-like constructs. 

Since a single alteration in the protein sequence 
of a CDR can substantially decrease the binding affinity of 
the construct for its antigen, the grafted CDRs are 
preferably homologous to those of the donor antibody; 
however, it is intended that one or more residues of a donor 
CDR can optionally be changed or omitted. The donor and 
acceptor antibodies can be polyclonal or monoclonal and can 
be of any antibody class or species.' Preferably, however, 
the acceptor light chains are derived from a human antibody 
most preferably igG, and the CDRs are derived from a donor ' 
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antibody from a non-human species selected from the group 
consisting of rodent, rabbit, and primate antibodies. Human 
donor antibodies may also be used and in one embodiment of 
the invention the CSV L recombinant antibodies are made using 
5 the same antibody as both donor and acceptor, i.e., the heavy 
chain CDRs are grafted into a kappa light chain and 
associated with a native kappa light chain to make an 
engineered light chain dimer fragment. 

A CSV L recombinant antibody may have attached to it 
10 an effector or reporter molecule. For instance, a macrocycle 
or chelating peptide may be attached for chelating a heavy 
metal atom. Similarly, a toxin, such as ricin, can be 
attached to the recombinant antibodies of this invention by 
any of a number of covalent binding structures known in the 
15 art. Alternatively, a fusion protein comprising a CSV L 
recombinant antibody joined by a peptide linkage to a 
chelating peptide or functional non-immunoglobulin protein, 
such as an enzyme or toxin molecule, can be produced using 
the procedures of recombinant DNA technology, for instance, 
20 the general methods of Neuberger, et al.. in PCT Patent 
Application No. PCT/GB85/00392. 

The term "antigen" as used herein shall encompass 
large protein antigens, such as carcinoembryonic antigen, in 
addition to haptens, such as metal-binding haptens. The 
25 ability to bind with an antigen or hapten is determined by 

assays well known in the art, such as antibody capture assays 
(See, for example. Harlow and Lane. Antibodies, a i. a h»r a r n rv 
Manua l, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York (1988) ) . 
30 The CSV L recombinant antibodies are made using 

techniques of genetic engineering that are well known in the 
art. (See for example European Patent Application EP 0 239 
400 to winter, et al., PCT Patent Application PCT/GB91/-U08 
to Adair, and U.S. Patent Nos. 5.132,405 and 5.091,513 to 
35 Huston, et al.) The terms "CDR grafted", "grafted with", and 
"grafted into', and the like, as used herein shall have the 
meaning well known in the art that, using the techniques of 
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genetic engineering, in one antibody, called the acceptor 
antibody, the CDRs are removed and replaced with those of 
another antibody, usually of another species, called the 
donor antibody, m the CSV L recombinant antibodies taught 
herein a CDR from the donor antibody can be grafted into a 
CDR locus in the acceptor immunoglobulin other than the one 
from which it is derived in the donor immunoglobulin. That 
iL CDRl ln the acceptor in»munoglobulin can be replaced with 

° r CDR3 from a d °n°r antibody, and so forth. The CSV L 
recombinant antibodies may comprise only one or two donor- 
deraved CDRs. though preferably all three CDRs are derived 
from the donor antibody and are grafted into the acceptor 
frameworks so as to replace the native CDRs therein, i.e 
donor CDRl of the opposite chain is grafted into the locus of 
CDRl xn the acceptor immunoglobulin chain, as used herein 
the terms -CDR- and "framework region- shall have the 
meanings and their locations shall be determined according to 
Che method of Wu and Rabat, J. RTTPi Mfifl JLtt:2ll-250 (1970), 
unless crystallographic analysis or homology modeling dictate 
that they have slightly modified locations. 

As used herein the phrase "derived from" and 
"altered" shall encompass the meaning that certain amino 
acids (less than or equal to 25% and preferably less than or 
equal to 15% of the total amino acid residues) in the 
acceptor framework regions of the CDR grafted constructs are 
switched to match the corresponding amino acids from the 
donor antibody as needed to facilitate the dual goals of 
preserving the binding affinity of the donor antibody and the 
expression levels of the acceptor antibody. 

The CSV L recombinant antibodies of this invention 
can be engineered to have the size, function and general 
design of an intact antibody or of any antibody fragments, 
such as Fv, Fab', single chain Pv, or single domain antibody 
(for example, an isolated heavy chain variable region), so 
long as each contains at least one CSV L domain. 
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The CSV L recombinant antibodies can be labeled for 
use in in vivo, diagnosis and therapy. For instance 
radioactive ions having suitable properties for use in in 
Yi32tt regimens can be attached to the recombinant antibodies 
5 under conditions similar to those known in the art. 

In the kappabody (Figure 3), the two chains are 
joined by one or more, preferably one or two, sulfhydryl 
bridges at the C -terminus of the light chain constant domain, 
in the native kappa chain, there is one sulfhydryl bridge to 

10 the heavy chain, but additional sulfhydryl -bearing cysterne 
residues could be added by incorporating all or part of the 
hinge region of an igG heavy chain or by fusing an 
appropriate metal-binding protein containing cysteine. 

Kappa and lambda dimer fragments occur in nature 

15 and result from spontaneous combination of light chains 

within the host cell upon expression. Like these naturally 
occurring light chain dimer fragments, those of the invention 
associate naturally within the host cell and are held 
together by weak bonding interactions between the two chains, 

20 (i.e., hydrogen bonding and Van der Waals forces), by a 

spontaneously formed disulfide bridge at the C terminus of 
the chains, as well as by any natural forces of attraction of 
the heavy CDRs for the light CDRs. Unlike the naturally 
occurring light chain dimer fragments, however, it is 

25 believed that the CSV L recombinant antibodies of the 

invention may experience dislocation of some of the sites of 
weak bonding interaction in the kappa chains (as compared to 
native kappa dimer fragments) due to strain caused by the 
splicing of foreign CDR's into the acceptor kappa chains. 

30 Therefore, in the kappabody fragments of the present 

invention certain residues in the acceptor framework regions 
holding the donor CDRs are preferably altered to overcome the 
effects upon affinity and specificity of the foreign CDR(s) 
and to ensure the ability of the engineered proteins to 

35 properly assemble upon translation. 
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These small (50 led), humanized molecules offer 
several advantages over Fab antibody fragments. First, they 
are readily expressed from the same vector due to uniformity 
of the two chains, th us allowing for rapid construction and 
. more equivalent expression of both chains. Second, since 
they are recombinant^ expressed, the native carboxy- terminus 
w present on both chains; whereas fragments created by 
treatment of whole antibody with enzyme lack the native 
terminus and therefore can be more immunogenic. And third 
these molecules, which have a structure distinct from Fab ' 
antxbody fragments, are expressed at high levels and are 
highly stable. 

It is known that during trafficking of 
immunoglobulin proteins within the eukaryotic cell, the heavy 
chain binds to the chaperon protein complex Bip/ G RP94 located 
within the rough endoplasmic reticulum, and is thereby 
Prevented from passage into the Golgi apparatus and thus is 
prevented from expression by the cell. A heavy chain is not 
secreted in eukaryotic cells unless or until it is displaced 
from the chaperon protein by a light chain, with which the 
heavy chain combines, thereby leading to secretion of intact 
antibody. For potentially similar reasons, a chimeric 
construct comprising the variable domain of a heavy chain and 
the constant region of a light chain (i.e., a v^fS^ntT 
will not be secreted by itself in mammalian host cells. 

However, the instant invention discloses that a 
genetically engineered gene encoding a CSV L fra^t when 
operably linked, to the required transcriptional and 
translational sequences functional in eukaryotic host cells 
suitable for expression of immunoglobulin genes, will be 
transcribed, translated and secreted. The secrete CSV L 
can be incorporated into constructs that also contain a light 
chain constant region and will convey upon the resulting the 
similar ability to be secreted in eukaryotic cells. 

indeed, just such a single chain fragment has been 
mentioned above as a preferred embodiment of this invention 
A species of this -heavybody- fragment is depicted in 
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Figure 2. As with the kappabody fragment, two different 
acceptor antibodies can be used, one for the light chain 
constant domain and another for the FRs, although it is 
preferred that the same acceptor antibody be used for both 
areas* As illustrated below, the heavybody fragment is 
secreted in mammalian cells as a homodimer (an assembly of 
two identical chains) in the absence of the expression of 
native light chain by the host cell. However, in the 
presence of light chain, a light-heavy heterodimer (an 
assembly of two significantly different chains) is 
preferentially formed. The binding affinity of the heavybody 
homodimer can readily be assayed, using methods known in the 
art, such as a competition ELISA. 

Unlike isolated native light chains of antibodies 
or native light chain homodimers, which do not possess 
binding affinity by themselves, the instant heavybodies 
(i.e., the single chain monomer) retain the ability to bind 
antigen. If it is desired to assay the binding affinity of 
an isolated heavybody, the sulfhydryl bridge (s) that join the 
chains of the heavybody homodimer can be reduced by treatment 
with enzyme under conditions mild enough to preserve the 
binding affinity of the isolated heavybody monomer using 
techniques well known in the art, or as is illustrated in the 
Examples. The heavybody is a very small (25 kd) humanized 
molecule of different structure from a native kappa or lambda 
chain. And, unlike a chimeric heavy chain, the heavybody 
molecule is secreted from mammalian cells with high levels of 
expression. 

As one skilled in the art will appreciate, the 
present invention enables production of recombinant 
antibodies of smaller size. For instance, fragments 
analogous to Fv fragments can be made from the variable 
domains of two acceptor light chains by grafting at least one 
light chain CDR into one copy of the light chain variable 
region and at least one heavy chain CDR into another copy of 
the light chain variable domain of the donor antibody. Like 
Fv fragments, these smaller constructs lack the natural 
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sulfhydxyl bridge that connects naturally occurring kappa 
dimers (and would leave out the light chain constant domain.) 

It is also possible to rapidly engineer and secrete 
in mammalian host cells at high expression levels a single 
5 domain construct comprising an acceptor light chain variable 
region with one or more donor antibody heavy chain CDRs 
grafted between the framework regions referred to herein as a 
CSV L fragment (as discussed above) . as illustrated in 
Figure l and described further in the Examples. This 
10 (embodiement of the present invention) further evidences that' 
the DNA sequences effective for expression of the heavybody 
fragments in mammalian cells are contained in the framework 
regions of the light chain variable domain. 

Not all antibodies or fragments with useful 
affinities for their antigen have heavy chain variable 
domains with sufficient affinity to bind with the antigen. 
However, by proper screening of the genome of a lymphoid 
cell, a heavy chain variable domain having CDRs with 
sufficient antigen affinity to bind as a single domain 
antibody can be found using techniques well known in the art 
For instance, Ward, et al., in 'Binding Activities of a 
Repertoire of Single Immunoglobulin Variable Domains Secreted 
from Escherichia coli.> 1^:544-546. (1989) disclose a 

method for screening to obtain an antibody heavy chain 
variable region (v„ single domain antibody) with sufficient 
affinity for its target epitope to bind thereto in the single 
domain format. 

Alternatively, a phage expression library can be 
prepared from V H DNA fragments using methods well known in 

30 the art. (See for instance, Garrard. L.J., et al., pct 
Patent Application PCT/US91/09133. assigned to Genentech 
Proteins expressed on the phage head can be screened using an 
affinity column having bound antigen or a polypeptide probe 
constructed from the peptide sequence of the desired target 

35 epitope or antigen, single domain v H antibodies that bind 
with the antigen can be selected and ranked to obtain those 
with the highest affinity for the antigen. These single 
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domain V H antibodies, however, cannot be secreted in 
mammalian cells. By adopting these and other techniques, 
such as molecular modeling, known in the art and/or disclosed 
herein, heavy chain variable domains showing antigen-binding 
5 affinity can be obtained and used as the donor antibody to 
make a single domain CSV^-containing recombinant antibody 
fragment according to the present invention, i.e., having one 
or more CDRs from a high affinity donor heavy chain variable 
domain grafted into the framework regions of a acceptor light 
10 chain variable domain, and preferably wherein the acceptor 
antibody has kappa light chains and is of human origin. 

As illustrated below in the Examples, a preferred 
embodiment of the single domain fragment of this invention, 
namely the CSV L fragments, can be expressed in mammalian 

15 cells, in contrast, a conventional single domain antibody, 
(i.e., one consisting of a V|j domain) cannot. 

With a molecular weight of 12.5 kd, approximately 
one sixteenth that of intact antibody, the CSV L fragments of 

the invention bind to target antigen with the specificity of 
20 the donor antibody, and with the potentially greater binding 
ability than the variable domain of a light chain alone. Yet 
these extremely small peptides will clear from the 
circulation more rapidly with decreased normal tissue 
retention and decreased immunogenicity, and penetrate tumor 
25 more extensively than any other size of antibody fragment. 
Even when the framework sequences of the CSVl fragment have 

been altered in accordance with this invention to facilitate 
folding of the molecule into a three-dimensional geometry 
that provides the specificity and a sufficient affinity for 
30 use in in viyQ imaging and therapeutic applications, the CSVl 

fragment proteins are generally approximately thirty to 
thirty five percent human when three non-human CDRs have been 
grafted into them. Therefore, these very small recombinant 
fragments, which can be rapidly engineered to improve 
35 affinity or specificity due to their small, single chain 

format, are particularly useful for in viyp applications that 
require rapid clearance of the unbound binding fragment from 
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the blood, such as i n V i vo radiotherapy using strong beta- 
emitting particles attached to the binding fragment. 

As mentioned above, insertion of donor CDRs into 
the acceptor framework regions can displace the CDRs out of 
their preferred spatial alignment. Association sites between 
the heavy and light chains can also be disrupted by 
introduction of the foreign CDRs so that the expression level 
of the CDR grafted construct is impaired relative to that of 
the intact acceptor antibody, in the CSV L recombinant 
antibodies of the present invention, an additional problem is 
encountered. Grafting of heavy chain CDRs into light chain 
framework regions in the making of a CSV L . can produce either 
different or additional dislocations of the sites in the 
framework regions that are necessary to support the CDRs in 
their preferred spatial orientations and dislocations of the 
association sites between the light and heavy chains that 
contribute to assemblage of the recombinant antibody chains 
during expression. 

To accomplish the dual goals of (1, preserving the 
spatial orientation of the CDR loops as it appears in the 
donor antibody, and (2) maintaining to the greatest extent 
possible the expression levels and reduced immunogenicity of 
the acceptor antibody, any of a number of available methods 
based on computer-assisted molecular modeling procedures can 
be used or modified for effectively identifying and replacing 
ammo acids in the acceptor framework regions to create CSV L 
recombinant antibody of this invention. 

For instance, Adair, j. , et al., pct Patent 
Application PCT/GB90/02017. assigned to Celltech, disclose a 
method for introducing mutations into acceptor framework 
regions of CDR-grafted antibody chains of anti-CEA antibodies 
to match the corresponding donor residues. In the Celltech 
method, in addition to the Kabat-def ined CDRs from the donor 
antibody (CDR1: positions 24-34; CDR2 : positions 50-56; CDR3 • 
positions 89-97, the structural loop residues (positions 89- 
97) x„ CDR3 and residues at one or more of positions 1, 2 
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and/or 3, 46, 47, 49, 60, 70, 84, 85 and 87 are replaced by 
the corresponding donor residues, if they differ. 

According to the Celltech method, in the heavy 
chain, in addition to the Kabat-def ined CDRs, the amino acid 
5 residues of the acceptor variable domain are replaced at 

positions 23 and 24 and 71 and/or 73 with those of the donor 
antibody, if they differ. Additionally, in the heavy chain, 
the acceptor residues can be replaced by donor residues at 
some or all of positions 48 and/or 49, 69, 76 and/or 78, 80, 

10 88 and/or 91 and 96. The definitions of the CDRs can also be 
shifted to accommodate idiosyncratic regions in any given 
donor antibody. 

Ideally, commercially available computer programs 
are used with actual crystal structures of the donor and 

15 acceptor antibodies (bound to their antigens) to determine 

which amino acids in the CDRs (and framework regions) contain 
atoms that are close enough to atoms in the amino acids of 
the antigen to interact* 

Yet another method, generally referred to as 

20 homology modeling, is useful when a crystal structure cannot 
be obtained for the antibody to be used in making the 
antibodies of this invention. Several fully automated 
algorithms to align crystal structures and define 
structurally conserved regions are known. The loop regions 

25 are modeled by two basic methods: 1) use of a data base of 
available structures to provide the best possible loop 
conformations or, 2) use of distance-geometry based 
mathematical model to generate further possible conformers. 
The best conformer chosen by either method of modeling is 

30 chosen on the basis of some type of energy function, usually 
an energy calculation* For instance, computer programs such 
as Insight II, Homology and Discover (Biosym, San Diego, CA) 
are employed in conjunction with a database containing the 
known crystal structures of proteins, such as the Brookhaven 

35 Protein Data Bank, to construct a three dimensional 

representation of the immunoglobulion of interest. This 
three dimensional representation is based upon homology 
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between structurally conserved regions (sCRs) of the known 
structures and corresponding regions in the protein whose 
crystal structure is unknown. A loop search algorithm is 
used to identify protein loops from the database with the 
right number of residues and correct three dimensional 
disposition of backbone atoms of the regions flanking the 
loop to splice between the structurally conserved regions 
in this way the three dimensional model of the donor and 
acceptor antibodies is constructed by the computer so that 
the residues of the donor antibody frameworks that are 
involved in supporting the CDRs and the residues of the 
acceptor antibody frameworks that are involved in chain 
association can be conserved in the CSV L recombinant 
antibody. 

The preferred method of making the Csv L 
recombinant antibodies of this invention, when actual crystal 
structures of the donor and acceptor antibodies are not 
known, employs molecular modeling. Molecular modeling can be 
used to locate the three dimensional structurally conserved 
regxons (SCRs) common among all antibodies. Separate 
computer models of the donor and acceptor immunoglobulins are 
constructed by a technique of homology modeling based upon a 
database of known protein crystal structures, such as the 
Brookhaven Protein Data Bank of known protein crystal 
structures, using the computer modeling programs Insight II, 
Homology and Discover, Version 2.1.2. Prom computer models 
of the donor and acceptor antibodies, the amino acid residues 
in each structure involved in association of the 
immunoglobulin chains in the acceptor antibody are determined 
and conserved in the CDR grafted construct, in addition, the 
amino acid residues involved in support of the CDRs in the 
donor antibody are conserved in the CSV^ recombinant 
antibodies. 

Briefly, for the purpose of modeling the light 
chain variable region of an antibody, at least two and 
preferably at least eight antibodies are selected from a 
protein database, such as the Brookhaven Protein Data Bank 
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that provides both a linear amino acid sequence and three 
dimensional atomic coordinates of each antibody variable 
region. The sequences and structures of these antibodies are 
manipulated by a computer program having the ability to 
5 assign the corresponding atomic coordinates from a segment of 
a known structure to the atoms of any segment of an amino 
acid sequence having the same number of residues. One 
skilled in the art will know of computer programs and 
databases that are suitable to work in tandem in this 
10 fashion. For example, the Brookhaven Protein Data Bank can 
be used together with the current versions of molecular 
modeling programs Insight II* Homology and Discover (Biosym 
Technologies, Inc., San Diego, CA) ; as discussed in the 
immediately following sections. 

15 

Step One - Definition off Structurally conftanad 

Bftgiflag Paint* Known ghra ft- Dimensional 

Structures off Aatiboaicfl 

20 Using the selected three dimensional protein 

structures and sequences from the database, the operator uses 
the computer program to align the sequences of the variable 
regions and to superimpose the corresponding structures so 
that structurally conserved regions can be identified. For 

25 instance, the sequences are aligned in a linear array, with 
each sequence constituting one row of the array, i.e., Seq a, 
Seq b, Seq c, etc. 

To facilitate alignment by placing the SCRs into 
columns using the Insight II software, certain landmark amino 

30 acids known to be universally conserved among antibodies, 
such as the cysteines that form the intrachain disulfide 
bridge, are identified in each sequence and are aligned in 
vertical columns. Taking the first two of the linearly 
aligned sequences, one, for instance Seq a, is designated to 

35 be held constant and the other, for instance Seq b, to be 
superimposed onto the first. (In practice, the bottom 
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sequence on a computer display is usually most convenient to 
hold constant . ) 

Three-dimensional alignment of the known structures 
is further refined by discovering additional amino acid 
sequences that correspond to regions in all the selected 
antibodies that preserve almost identically the same three- 
dimensional conformation, called herein the structurally 
conserved regions (SCRs) . 

using the already superimposed structures, the 
first putative SCR, conveniently designated SCRlab, is 
discovered by visual inspection. Preferably, successive SCRs 
are identified by working from the amino to the carboxy 
terminus of the molecules. The RMS deviation of the backbone 
atoms in the two segments of amino acids corresponding to 
15 SCRlab is calculated. The exact locations of SCRlab, and 
hence of the amino acids contained within the segments 
corresponding to the SCRlab, are adjusted by a procedure of 
trial and error whereby the amino acids in the linear 
sequences of the array that correspond to those in the 
putative SCRlab are boxed and the RMS deviation is 
calculated. The width of the box is maximized and the 
location of the box is adjusted until the RMS deviation 
reaches an acceptable maximum, for instance no more than 
about 0.75 Angstroms. 

To ensure that spatial alignment of SCRlab at the 
amino terminus of first and second structures is not 
destroyed by establishment of subsequent SCRs along the 
sequences (i.e.. SCR2ab and SCR3ab, etc.), preferably after 
the process has been carried out to define SCR2ab, the two 
structures are superimposed again using the residues for the 
backbone atoms in SCRlab as well as SCR2ab. This process is 
repeated for each subsequent SCR. Gaps, for example, empty 
space holders, can be inserted within nonconserved 
(nonhomologous) regions, referred to herein as NSCRs. 
35 Usually the NSCRs are found in the loops and CDRs. Gaps are 
inserted as needed to accomplish vertical alignment of the 
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SCRs, for example, where any sequence had fewer amino acids 
between the SCRs than did the other. 

usually by the method of this invention, from about 
seven to ten SCRs are established between any two light chain 
5 variable regions* with each SCR containing from about three 
to twenty amino acids from each of the structures, when the 
RMS deviation of the backbone atomic coordinates in the SCRs 
is no more than about 0*75 Angstroms. 

Once two of the structures have been aligned in 

10 this manner, the procedure is repeated, preferably by 
selecting the first structure, for instance the bottom 
structure in the array, to be held constant (Seq a), and 
discovering the SCRs between that first structure and each in 
turn of the other structures represented in the linear array 

15 (Seq b, Seq c, etc.) to yield SCRlac, SCR2ac, SCR3ac, etc. 
and then SCRlad, SCR2ad, SCR3ad, etc. Alternatively, of 
course , any other method can be used whereby segments having 
a common spatial conformation, such as SCRs, are located 
within the known three dimensional structures of from six to 

20 ten antibody variable regions. For instance, one skilled in 
the art will appreciate that it would be possible to locate 
the first SCRs in the middle of the molecules and work 
outward therefrom in either direction, or to begin at the 
carboxy terminus of molecules and work progressively towards 

25 the amino terminus. The order in which the sequences (and 
their structures) are compared with one another can also be 
varied. For instance, one skilled in the art will appreciate 
that it would be possible not to hold a first structure 
constant, and instead to align any two structures and then to 

30 chose any one of those two structures to be aligned with a 
third, and so forth. 

When all of the structures have been compared with 
one another by any of the alternative methods described 
above, for instance when each structure in the array has been 

35 in turn superimposed and aligned with the constant first 
structure, as is preferred herein, the next step is to 
identify the consensus SCRs. A consensus SCR comprises the 
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residues in each linear sequence that are in the intersectic 
of all of the individual SCRs. One skilled in the art will 
appreciate, however, that the technique of locating the 
consensus SCRs can be varied so long as structurally 
conserved regions (SCRs) conunon to all of the structures in 
the array are located, and so long as the RMS deviation of 
the coordinates corresponding to the superimposed backbone 
atoms xn all of the structures is acceptably low. for 
instance no more than 0.75 Angstroms. 

A similar procedure is followed to locate and fix 
in spatial relation to one another the SCRs common to the 
heavy chain variable domains of antibodies, except that the 
sequences used in the linear array are those of the heavy 
chain variable domains of the antibodies in the database^ 
whose three dimensional structures are known. 

- ttrm-illwmMnn,! n^itOina nf Brrmn , 
antT Mont;! firm-tan rjf rh n 1n 

Now the linear sequence of the acceptor antibody 
chain to be modeled is displayed as an additional row in the 
linear array and aligned with the sequences of the eight 
database antibodies as described above to discover the 
segments of SCRs in the acceptor chain that correspond to 

as ne e ede n dT * ^ ^ ^ " ~* «~ 
L ZZl , Sh ^ VertiCal This process 

xs identical for light and heavy chains. The three- 
dimensional model of the acceptor antibody chain can now be 
fabricated in segments from the consensus SCRs derived above 

l r in aC th SCR Y n ^ Unear S6 ^ nCe ° f the — to r *™ 
tt SCRi : it C h ° f SaUS ^ ^ ^ is ins ^ *> find 
scri J greateSt S6qUenCe h ° mol °^ t0 the ^ceptor 
SCRI. The computer is used to construct the model of the 

acceptor SCRI by assigning to each residue in the acceptor 
SCRI coordinates corresponding to those of the selected 
sequence from the column of corresponding database SCRls 
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At this point, any residue in the selected 
SCR1 that does not match the corresponding residue in the 
donor SCR1 is mutated to match the residue in the acceptor 
SCR1, while the coordinates of all the atoms in the backbone 
5 and sidechains that correspond to those in the acceptor 

residue are conserved. The remaining atoms are modeled under 
the constraints of maintaining the same bond lengths, angles 
and dihedrals as those in the original database residue, 
i.e., for the gamma and delta carbons. The process is 
10 repeated for each of the subsequent SCRs, i.e., SCR2, SCR3, 
etc. 

Next, the length of each segment of NSCR in the 
acceptor chain sequence, i.e., the spanning sequence between 
each successive pair of boxes, is determined. Progressively 

15 from the amino terminus of the chain, NSCR segments of the 
acceptor chain are modeled by selecting loops from the 
protein database to span between the endpoints of the SCRs of 
the acceptor chain model constructed above. The actual 
number of amino acid residues in each NSCR is counted 

20 (ignoring the space-filling gaps used to accomplish vertical 
alignment) . For each span individually, the computer is 
instructed to search the protein database, for instance using 
the Loop Search algorithm as is well known in the art, to 
discover from about eight to twelve candidate amino acid 

25 sequences having (1) the same number of amino acids as the 
actual acceptor NSCR and (2) flanking regions with the same 
relative atomic coordinates as the flanking SCRs in the 
acceptor chain model as determined above. As one skilled in 
the art will appreciate, depending on local structural 

30 details, either all or some subset of the residues adjacent 
to the loop in each SCR box can be identified as the flanking 
residues. The candidate sequences whose flanking regions are 
best fits with the relative atomic coordinates of the SCRs of 
the acceptor chain model, as determined by computer 

35 algorithm, are selected. 

It has been discovered that in antibodies the 
general spatial conformation of the loops and NSCRs is 
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conserved. Therefore, the best candidate for NSCR1.2 in the 
model should have a three dimensional spatial conformation 
generally similar to that of the corresponding NSCR1. 2 
segments in the antibody database structures. For each 
5 candidate that meets this general requirement, the backbone 
atoms of the flanking residues of the candidate NSCR are 
superimposed on the backbone atoms of the corresponding 
flanking residues of the sCRs of the model that flank the 
NSCR under consideration. For example, to consider the 
10 candidates for the NSCR1,2 position in the model, the 

backbone atoms of the flanking residues of the candidate NSCR 
are superimposed on the backbone atoms of the corresponding 
flanking residues of the SCR1 and SCR2 sequences from the 
mode^ and the candidate having (1, the best RMS fit of the 
backbone atoms of its flanking residues with backbone atoms 
of the corresponding flanking residues from SCR1 and SCR2 and 
(2) a spatial orientation most like that of NSCRls of the 
database antibodies displayed on the computer screen (to rule 
out interference with other loops, is selected. By repeating 
this procedure at each NSCR position, i.e. at NSCR1.2; 
NSCR2.3; NSCR3.4. etc., the acceptor NSCRs are selected and 
then placed into the acceptor model as follows. 

Once the best spatial orientation for an amino acid 
sequence of the given loop length for each NSCR is selected 
the coordinates of the backbone of the candidate segment are 
assigned by the computer to the corresponding NSCR in the 
model, now any residue in the selected candidate sequence 
NSCR dissimilar to the corresponding residue in the actual 
sequence of the acceptor NSCR is mutated to match the 
acceptor sequence while the computer algorithm is used to (1) 
maintain the coordinates of all the atoms common between the 
two, and (2, model the dissimilar atoms while constraining 
the bond lengths, angles and dihedrals to those of the 
candidate residue. 

Once all of the NSCRs making up the model are in 
turn selected from the database, fixed in space, and modeled 
to transform them into the coordinates of the corresponding 
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acceptor NSCRs, the splice regions where the SCRs join the 
NSCRs are preferably refined to relieve any strain in the 
model that results from joining the SCRs and NSCRs, This 
refinement can be accomplished using any suitable computer 
5 algorithm, for instance the "Repair" algorithm in Insight II, 
to assign the proper bond lengths, bond angles, and omega 
values to the residues at the splice junctions. 

Now, the model as a whole is relaxed using a 
suitable computer algorithm to relieve any strain occasioned 
10 by the above procedures. Preferably the "Relax" algorithm of 
Insight II is applied in a series of sequential steps to the 
model as a whole. Preferably, the order of the steps is to 
apply the algorithm: (1) to the side chains of the NSCRs to 
assign proper geometries and remove any unfavorable non- 
15 bonded contacts between side chain atoms and other atoms in 
the molecule, (2) to all atoms of the NSCRs to remove any 
remaining unfavorable contacts between the NSCR and other 
atoms in the molecule, (3) to the mutated side chains of the 
SCRs to remove any unfavorable non-bonded contacts between 
20 mutated SCR side chain atoms and other atoms in the molecule, 
and (4) to all of the side chain atoms of the SCRs to remove 
the remaining unfavorable sidechain contacts. 

Finally, an energy minimization procedure is 
performed using techniques well known in the art, for 
25 instance, using the "Discover" subprogram of Insight II, to 
allow the model to assume an energetically favorable 
conformation. In the preferred embodiment, however, the 
energy minimization is performed in a series of sequential 
steps. The entire model is first subjected to energy 
30 minimization with backbone atoms tethered to their starting 
coordinates with a force constant of 100 kcal/A 2 . Then an 
energy minimization is performed for the entire model without 
the backbone atoms being tethered. The result of carrying 
out these steps is a model of the variable domain of each of 
35 the acceptor chains. 

In the method of this invention, the model of the 
acceptor Fv is made by the following steps: (1) identify 
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potential chain association residues by comparison of the 
sequence of the acceptor chain with the linear array of known 
structures and select an appropriate known structure to use 
in modeling chain association of the acceptor molecule, (2) 
make a preliminary model by superimposing the backbone atoms 
of the potential chain association residues of the selected 
known structure, (3) subject the entire molecule to energy 
minimization, first, with the backbone atoms being tethered 

in k° !u initlal Coordinates and, second, without the 
10 backbone atoms being tethered. (4) identify the chain 
association residues in the final acceptor Fv model 
excluding all residues that are part of a CDR. 

in the first step, in one chain of the structures 
of each of the database antibodies, each residue in the 
variable region of that chain having an atom within 4 5 
Angstroms of an atom in a residue in the other chain is 
identified. If the residues so identified in each database 
antibody are not part of a CDR and are likely to have a 
significant interaction with residues in the other chain, 
they are earmarked in the linear sequence of the antibody as 
chain association residues. The process is repeated for the 
other chain of each database antibody. 

are ff™ ^ reSidU6S in Chain association 

that 117 "° nSerVed antibodies, it can be assumed 

resid^f 6 ^ h ° m0l09y b6tWeen Chain association 
residues m the v L and v„ of the database antibodies and 

those in the VL and VH of the acceptor antibody or 

immunoglobulin. Therefore, when a residue in the acceptor 

sequence is found to be identical to one earmarked in the 

array. lt is earmarked as a chain association residue in the 

acceptor model. On the other hand, when- an amino acid is 

found in the acceptor that differs from the corresponding one 

in the database antibody in any of the positions in the 

database sequences earmarked as chain association residues 

it is designated as potentially disruptive to chain 

asso Ciati Ea ch database antibody is compared with the 

acceptor molecule. The database antibody with the greatest 
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excess of favorable residues over disruptive residues is 
chosen. 

In the second step, superimposition is accomplished 
using a program such as the "Superimpose" command in Insight 
5 II. 

in the third step, a program such as "Discover" in 
Insight II is used to carry out the energy minimization, with 
the back bone atoms being tethered to their initial 
coordinates with a force constant (usually 100 kcal/A 2 ) for 

10 the initial minimization and with no tethering for the final 
minimi zat ion • 

In the fourth step, chain association residues in 
the light chain are identified as all residues from the light 
chain that contain an atom within a specific distance of any 

15 atom of any residue in the heavy chain selected as indicating 
possibility of significant interaction there between (usually 
about 4.5A). Similarly, chain association residues in the 
heavy chain are identified as all residues from the heavy 
chain that contain an atom that is within a specific distance 

20 of any atom of any residue in the light chain selected as 
indicating possibility of significant interaction there 
between (usually about 4.5A) • 

Step Throe - The Three -dimensional Modeling of 
25 Donor Fv and Identif ieation of cn ff 

Associated ft««ldii«« 

Models of donor Fv are arrived at in a manner 
identical to that described above for the acceptor Fv. 

30 CDR-associated residues are identified after 

minimization by determining those residues containing an atom 
within a specific distance of any atom of any residue found 
within a CDR selected as indicating the possibility of 
interaction there between (usually about 4.5A). These 

35 residues are defined as CDR-associated residues and are 

treated in a step in the humanization process described in 
Step 4 below. 
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8tBP Fm i r " to***-™™™*™*, tmmtlnrr nf 

The CDR-associated residues determined above are 
now identified in the primary amino acid sequence of the 
donor molecule, and the primary sequence for the altered 
Ught and heavy chain CD* grafted molecules are pieced 
together in segments. 

in , FirSt ' the P rin »*y amino acid sequences of the 

donor and acceptor molecules are aligned with reference to 
the sequences of the known database structures. Second, on 
the donor linear array: (1, the CDR-associated residues 
determxned above are identified, (2, for SCRs or NSCRs that 
do not contain a CDR residue or a CDR-associated residue, the 

ZZl ° f ^ 6ntire Se9TOent iS With ^ sequence 

tTzT C ° rreSP ° nding 8e9ment ° f the acce * tc * "o^cule. (3, 
for SCRs that contain one or more CDR residues or CDR- 

CDR associated in the segment are replaced with those of the 
0 acceptor molecule, but the CDR residues and CDR-associated 

that el" 6 C ° nSeiVed " ^ d0n ° r reSidU6S < (4) - NSCRs 
residZ < T " ^ reSidU6S ° r CDR — ociated 

5 L the do " aCCePt ° r ' ^ ^ NSCR unserved 

as the donor sequence, (5, in NSCRs that contain ^ 

CDR rescues or CDR-associated residues, if the total nunfcer 
of residues in the NSCR is the same between the donor and 
acceptor, those residues that are neither CDR nor CDR- 
a --iated are replaced with those of the acceptor molecule 
> whxie the CDR residues and CDR-associated residues are 
conserved as the donor residues. Thus, in all cases, CDR 
residues and CDR-associated residues in SCRs or NSCRs are 
conserved as the donor residues. 

Third, the donor and acceptor models are 
superimposed. Once the two models are brought up on the 
computer screen, SCRs are determined, m this step SCRs are 
derxved xn a way distinct from that used in construction of 
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the acceptor and donor models. In the latter case the SCRs 
were assigned to the donor and acceptor based on the 
consensus SCRs determined from the known structures . In this 
step, SCRs are determined anew from the two models alone in a 
5 manner analogous to that used to determine the SCRs between 
each of the known structures, as described in Step 1 above 
(wherein the acceptor was designated to be held constant and 
the donor was superimposed upon it) . 

using the modeled three dimensional structures and 

10 sequences for the acceptor and donor Fvs, the operator uses 
the computer program to align the sequences for the Fvs and 
to superimpose the corresponding structures so that SCRs can 
be identified* For instance, the sequences are aligned in a 
linear array with each sequence constituting one row of the 

15 array, i.e. seqA (for acceptor) and seqD (for donor). To 
facilitate alignment using the Insight II software, certain 
landmark amino acids known to be conserved among antibodies, 
such as the cysteines that form the intrachain disulfide 
bridge (i.e., the light chain cysteines at L23), are 

20 identified in each sequence and are aligned in vertical 
columns, as described in Step 1 above. 

Three dimensional alignment of the two structures 
is further refined by identifying SCRs and superimposing 
them. Using the already superimposed structures, the 

25 putative SCR1AD is discovered by visual inspection. 

Preferably, successive SCRs are identified by working from 
amino to car boxy terminus of the molecules. The RMS 
deviation of the backbone atoms in the corresponding segments 
of amino acids in the two structures is calculated. The 

30 exact location of SCR1AD, and hence of the amino acids 

contained within the segments corresponding to SCR1AD, are 
adjusted by a procedure of trial and error whereby the amino 
acids in the linear sequences of the array that correspond to 
those in the putative SCR1AD are boxed and the RMS deviation 

35 is calculated. The width of the box is maximized and the 
location of the box is adjusted until the RMS deviation 
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reaches an acceptable maximum, for instance no more than 
about 0.75A. 

To ensure that spatial alignment of SCR1AD at the 
ammo terminus of the two structures is not destroyed by 

Sc^ iS ^T ° f SUtSeqUent SCRS alon * the ••**«*- (i.e., 
SCR2AD. SCR3AD, etc.,, after the process has been carried out 
to de f SCR2 the two scructures are superimposed ^ 

as sL^n "1 ^ *** h °~ at ° mS in SCR1AD as 

SCR2AD. This process is repeated for each subsequent SCR. 

Gaps, for example empty space holders, can be 

inserted within NSC Rs as needed to accomplish vertical 

alignment of the SC Rs For example, where any sequence has 

clTl^ a<=idS b6tWeen «*• SCRS than *>« s ^e other, gaps 
can be used to make the two of equal length. 

as - „ • e9ment in the *^ered CDR grafted chain is 

assigned spatial coordinates that correspond to those of the 
donor or acceptor residue to which it corresponds. 
Preferably this is done working from the amino to the carboxy 
terminus of the chain. «rooxy 

0 

Now the light and heavy chain minimized models 
constructed above are displayed on the computer screen 

T*"" * M —» -mixtion is performed to 
allow thrs pv model to assume an energetically favorable 
conformation using the steps described above. 

As a final check, the model is examined to 

iTaUereTcT "! "~ appear in 

^e altered. CBK-grafted model using tne technics described 
above if any new era-associated residue is seen in the 
altered CDK-grafted (and humanized, model, the amino acid at 
that position is replaced by the one found in the donor 
2 Mter the ^.-associated residues are modified as 

necessary, the model is analyzed to determine whether all the 
chain association sites identified in the acceptor model have 
been conserved in the altered CDR-graf ted model, if 
differences are observed, they should be noted as possible 
future sites for mutagenesis if a significant decrease in 
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secretion of the altered CDR-grafted protein is observed as 
compared to that of the acceptor molecule. 

StW Tim - The Thr««»-dlm«»,<r>n«i jjQ^Jjxg 

Qt CSVr^CDB-^ft-^-Yl. Ft 

The acceptor light and donor heavy chain primary 
amino acid sequences had already been aligned with reference 
to different sequences. Therefore, it was necessary to 
bridge these alignments through realignment using a common 
sequence, in addition, the acceptor heavy chain provided 
information on chain association residues. Donor heavy chain 
sequence was added to a linear array containing light chain 
donor and light and heavy chain acceptor sequences and 
15 aligned. Once aligned in this manner, SCRs were defined 
there between as described in Step one. the Rabat defined 
CDRs and CDR-associated residues determined in step Three 
were identified on the donor heavy chain linear array. For 
SCR or MSCR regions which do not contain a CDR or CDR- 
associated residue, the entire region was replaced with the 
acceptor light chain sequence (and structure, i.e., 
coordinates),. For SCRs which contain one or more CDR or CDR- 
associated regions, the non-CDR-associated residues were 
replaced with acceptor sequence (and structure, i.e., 
coordinates), but donor heavy chain sequence (and structure, 
i.e.. coordinates) was conserved for the CDR-associated 
residues. For NSCRs that contain one or more CDR or CDR- 
associated residues, the donor heavy chain sequence (and 
structure, i.e., coordinates) was conserved for the entire 
30 region, m this way the primary sequence for the heavy chain 
CDR-grafted molecule was determined, and a composite 
structure was developed. 

Now, the resultant model was modified to assure 
that chain association residues, derived from the acceptor 
35 model were conserved, in all non-CDR or non-CDR-associated 
regions, when the amino acid in the position occupied by the 
chain association residue was different than the 
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corresponding acceptor heavy chain chain-association residue. 
lie i„\r^" "° ChalD — «3idues were found to 
5 event ,H T" " "^ns. In the unlikely 

event that this should occur, the residue should be noted^ 
hut no change should be made. 

Alternatively, humanized light chain can be used as 
acceptor and humanised heavy chain can he used as donor m 
this case, chain association residues used for the 
preliminary Fv model are those identified for humanized fv 

lien, 0°" that coordinat " "ad been assigned for both 
light a™, he.vy/lig ht hybrid chains, these were displayed on 

ZITT .r 6ther - *" — * was performed 

15 an ^ Dl " ov «- '<*program to allow the model to assume 
15 an energetically favorable configuration. First the entire 

atoms tl r CCBd " * MrBy — i"*- «ich backbone 
atoms tethered to their starting coordinates with a force 
constant of 100 Kcal/A 2 Th»„ rk . 

aW<..>™ ' Then °* • ner « r minimization 

20 L'^ "™ aPPU " d to °» without the 

20 backbone atoms being tethered. 

* ,. ^""aaociated residues were determined for the 
7n l.Z A9ain - 61118 " a3 done * "rst identifying 

also hf Uflhe <=»> residue, and that 

orUntacLn :f 15 t h £iC " e U,teUh00t, °' ^"action, based on 
H«t "n resfd ^<*°"city. etc. 

all residues on the light or heavy/light hybrid chain 

30 ^ 4 " 5 * 0t «" he ^'"*t hybrid chain CDR 

^Tres^ , ° f SlS ^ ticant faction with the 

CDK resxdue of interest. I» this way. the entire set of 
light and heavy chain CDF-associated residues was determined. 
IS tn- h ■ "* °* ""-""claMa residues determined for 

oeterT^T ^ ~ " ^t 

determined for the donor Fv. ln any case where an additional 
CDR-associaced residue is seen f nr auoitional 

e seen £or tl>e humanized, the amino 
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acid at that position was replaced by the amino acid found in 
the murine donor. 

After the CDR-associated residues were modified as 
necessary, the model was analyzed to determine if the chain 
association residues identified for acceptor were conserved. 
In this example, they were conserved, if, however, 
differences are observed, these are noted, but no changes are 
made at this time, if, in addition, there is a significant 
decrease in expression observed for the humanized molecule, 
these are potential sites for modification. 

As can be seen from the results presented in the 
Examples below, these modeling methods yield high affinity 
CSV L recombinant antibodies from an initial design without a 
requirement for iteration. 

This method of modeling cdr switched antibodies 
using structurally conserved regions can readily be modified 
by one skilled in the art to produce the CSV L recombinant 
antibodies of this invention, such as heavybodies or CSV L 
fragments . 

The acceptor amino acids identified as candidates 
for switching to donor amino acids by molecular modeling can 
be switched by oligonucleotide directed or site-directed 
mutagenesis of the DNA sequences encoding the cdr grafted 
heavy and light variable regions, for instance, as taught by 
25 T. Kunkel, PrOC. Nf>f1 ftrafl. Sri HS&. 82:488-492 (1985) or 
by codon-based mutagenesis whereby an amino acid alteration 
is obtained for each in Vitro substitution of a three 
nucleotide codon (Huse, et a2., Science, 246:1275 (1989)). 
Preferably however, the DNA of the entire variable region of 
the heavy and light chains is prepared by oligonucleotide 
synthesis as described hereafter. 

Once the DNA encoding a CSV L recombinant antibody 
has been prepared, it is then incorporated into a vector and 
operably linked to nucleic acid sequences encoding 
transcriptional and translational regulatory sequences. Any 
suitable expression vector may be used in this invention and 
exemplary vectors are provided in the Examples below. Those 
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with skill in the art win apprec . ate chat the cho . ce Qf 
vector is li mited to those vectorg Qf direc 

expre Ssion of che nucleic acid ^ ^ 

to those vectors that can incorporate and support the 
5 functxon of the regulatory regions used. Further, the choice 
of vector is limited by the cell type selected. Not all 
vectors and not all regulatory elements necessary for 
recombinant protein expression function in all cell types 
As a general rule eukaryotic expression vectors are suitable 
for protein expression in eukaryotes and prokaryotic • 
expression vectors are suitable for prokaryotes. Both types 
of vectors are commercially available and those with skill in 
the art of molecular biology will be able to select 
appropriate vectors suitable for recombinant protein 
expression within a given cell type. 

nucl.^ h ° dS inCOrporati ^ * Particular region of 

nuclexc acid xnto a nucleic acid vector are well known in the 
art of molecular biology ( See Sambrook, etal.. MoW,,," 

tomim* MUmrnrv m I , second Edition, coidl^ 

Harbor Laboratory Press, 1989). For example, short regions 
ofnuc exc acid (less than 400 bp, can be prepared bv 

to the d" g T e antiSeMe 0li ^-^tides complementary 
to the desired gene sequence that overlap. These 
oligonucleotides hybridize to one another, and can be 

Zlotrli? 9 r Cti0n ' Ugated ^ --rporated into an 

AfllP l 1firf>rinn , W.H. Freeman and Co., Ne w York, 1992). 

in general, the recombinant antibodies of this 
invention can be prepared by recombinant methods known in the 
art (see generally, Sambrook, eC «,.. fillB£a) £rom the 
acxd and DNA sequences of the donor and acceptor antibodies 

antiZT; "I 3 m ° n0Cl0nal ***** is ™* « ^e donor* 
antibody, hybrxdoma or polydoma technology using conventional 
procedures for xmmunization of m^als with an immunogenic 
antxgen preparatxon, fusion of immune lymph or spleen cells 
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with an immortal myeloma cell line, and isolation of specific 
hybridoma clones can be employed to obtain the monoclonal 
antibody. 

Alternatively, the genes encoding the donor and 
acceptor antibodies can be obtained by methods known in the 
art. for instance by chemical synthesis, as described above, 
if the sequences of the genes are known, if the sequences 
are not known, or if the genes have not previously been 
isolated, they may be cloned from a cDNA library (made from 
RNA obtained from a suitable tissue or batch of cells in 
which the desired gene is expressed, such as a hybridoma or 
polydoma) or from a suitable genomic DNA library. The mRNA 
is extracted and cDNA for the coding regions is derived using 
the enzyme reverse transcriptase and methods well known in 
15 the art. The gene is then identified using an appropriate 
molecular probe. For cdna libraries, suitable probes include 
monoclonal or polyclonal antibodies (provided that the cdna 
library is an expression library), oligonucleotides, and 
cdnas or fragments thereof. The probes that may be used to 
isolate the gene of interest from genomic DNA libraries 
include cdnas or fragments thereof that encode the same or a 
similar gene, homologous genomic DNAs or DNA fragments, and 
oligonucleotides. Screening the cDNA or genomic library with 
the selected probe is conducted using standard procedures as 
described in chapters 10-12 of Sambrook, et ai. ( supra 
From the sequence of the cDNA or that of the 
genomic DNA, the corresponding amino acid sequences to be 
used in molecular modeling are deduced, usually by a computer 
software program, such as is commercially available from 
30 DNAStar (Madison, wi) . once the amino acid sequences of the 
donor and acceptor antibodies are known, their CDRs are 
identified using the procedure of Rabat and Wu, sunr ^ For 
modeling and construction of a CSV L domain, the amino acids 
corresponding to at least one and preferably all three CDRs 
of the acceptor VL are replaced with CDRs of the donor vh. 
Additional donor residues identified by molecular modeling as 
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useful for retaining binding affinity and/or chain 
association are determined as described above. 

CDR craft J^* " UCl f° tide capable of encoding the 

graf tCd CSV ^ domain h ** ^en determined from the protein 
5 sequence, it is fabricated and ligated into a suitable 
repUcable expression vector, optionally along with the 
desired constant region genes from the acceptor antibody a 
similar procedure is then followed to construct the vector 
containing the genes encoding the associated CDR grafted 
light chain or heavy chain if applicable using methods well 
toown in the art. 

CDR-araf t J' " ^ that the DNA encoding the entire 

graft6d Variable including the CSV L domain(s, be 

inserted into an appropriate sequencing vector (e.g. a TA 
vector) and sequenced employing, for instance, the 
Sequenaseli kit (United States Biochemical, Cleveland. OH) 
used with a Genesis® 2000 automated DNA sequencer (Dupont, 
Wilmington de> according to the manufacturer's instructions. 
The spliced and sequenced exon is then excised from the 
sequencing vector and ligated into a vector that may 
optionally contain one or more exons encoding constant 
regions for the CDR-grafted chain, if it is desired to 
Produce an recombinant antibody having a light and a heavy 
chain, the DNA encoding the light chain can be spliced into 
one vector and the DNA encoding the heavy chain can be 
spliced into another vector. Alternatively, the DNA encoding 
both chains can be spliced into the same vector. 

To obtain the recombinant antibodies of the 
■invention, the DNA encoding one or more immunoglobulin chains 
prepared as described above is ligated into a replicable 
expression vector so as. to be operably linked to 
transcription regulatory element (s) ; suitable host cells are 
transfected with the vectors; and the transformed host cells 
are cultured under conditions favorable for forming the 
35 desired recombinant antibodies. 

various types of vectors may be used such as 
Plasmids and viruses, including animal viruses and 
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bacteriophages, m the embodiment, a vector is employed 
which is capable of integrating the desired gene sequences 
into the host cell chromosome. The cells which have stably 
integrated the introduced DNA into their chromosomes can be 
selected by also introducing one or more marker genes which 
allow for selection (i.e., growth of the cells in the 
presence of a toxic drug) of host cells which contain the 
expression vector. The introduced marker gene sequence will 
be incorporated into the plasmid or viral vector containing 
the gene(s) encoding the construct containing a CSV L domain. 
Factors of importance in selecting a plasmid or viral 
expression vector include the ease with which recipient cells 
that contain the vector may be recognized and selected; the 
number of copies of the vector which can be introduced or 
desired in a particular host; and whether it is desirable to 
■shuttle- the vector between host cells of different species. 

Eukaryotic expression vectors for yeast or 
mammalian cells, as well as prokaryotic expression vectors, 
may be used to express the recombinant antibodies of this 
20 invention. 

Although, either eukaryotes or prokaryotes can be 
used as host cells for this invention, the modeling methods 
used are exceptionally appropriate for eukaryotic cells, and 
more specifically for mammalian B lymphocytes. 
Alternatively, however expression can be obtained in a 
multitude of species, using suitable vectors and hosts. 
Suitable prokaryotic host cells include E. coli strain JM 
101. E. coli K12 strain 294 (ATCC No. 31,336), E. coli strain 
W3110 (ATCC No. 27.325). E. coli X1776 (ATCC No. 31,537), £. 
coli XL-l-Blue (Stratagene) . and E. coli B; however, many 
other strains of E. coli. such as HB101, NM522, NM538, MN539, 
and many other species and genera of prokaryotes may be used' 
as well, in addition to the e. coli strains listed above, 
bacilli such as Bacillus mihTi]i«i, other enterobacteriaceae 

such as salmnnpllft rvnhinrnrium or serrate narceaana and 

various Pseuflomnrifm species may all be used as hosts. As is 
well known to one skilled in the art, it is necessary to 
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remove any introns from eukaryotic genes which are to be 

expressed in prokaryotic hosts. 

When the vector is designated for expression in 
baculovirus, suitable promoters and enhancer sequences 

include, but are not limited to AcMNPV polyhedrin, acmnpv etl 
and acmnpv pio sequences. One particularly suitable 
polyadenylation signal is the polyhedrin acmnpv. ig Kappa 
19 Heavy and AcMNPV are examples of suitable signal 

10 ce e nT eS * Th6Se V6Ct0rS US6fUl in thS followi ^ insect 
ceil Imes. among others: SF9, SF21 and High 5. 

Alternatively, the polypeptides can be expressed in 
yeast strains such as PS23-6A. W301-18A, LL20, D234-3 
INVSC1 , INVSC2 , YJJ337. 9gomt „ and enhancer sequences such 
15 PEFT "1 are useful, vra-4 also provides a 

suitable enhancer sequence. Sequences useful as functional 
origins or replication- include arsl and 2u circular 
plasmid. 

lin« . Followin * Procedures outlined above, mammalian cell 

20 nZLTZ 33 7 el0,na (P3 " 653) ° r hybrid ° na (SP2/0) ' Chi -se 
Hamster Ovary (CHO, . Green monkey kidney (COSI) and murine 

fibroblasts (L492, are suitable host cells for expression. 

These -mammalian- vectors can include a promoter, an 

enhancer, a polyadenylation signal, signal sequences and 

.5 to, geneticin (neomycin resistance), mycophenolic acid 

xanthine guanine phosphoribosyl transferase) or histidinol 
(histidmol dehydrogenase). 

Suitable promoters for use in mannnalian host cells 
include, but are not limited to, ig Kappa, ig heavy, 
cytomegalovirus (CMV) Mediate early, Rous Sarcoma virus 
RSV), Simian virus 40 (SV40, early, mouse mammary tumor 
(MMTV) virus and metallothionein. Suitable enhancers 
include, but are not limited to Ig Kappa, ig Heavy, CMV early 
and SV40. suitable polyadenylation sequences include ig 
Kappa, ig Gamma or SV40 large T antigen. Suitable signal 
sequences include, but are not limited to. ig Kappa, ig Heavy 
and human growth hormone (HGH) . ^ 
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For expression in mammalian cells the vectors 
containing the DNA encoding the heavy and light chain genes 
of the antibody construct can be placed into separate 
bacterial amplification vectors, such as E. coli DH 10 B 
5 Electromax (brl, Gaithersburg, md. ) , cultured, and screened 
for antibiotic resistance to amplify the plasmid. Generally, 
the DMA of the selected clones is verified by restriction 
digestion and DNA sequencing. Double stranded dideoxy 
sequencing is performed, for example on a DuPont Genesis® 

10 2000 instrument, using the DuPont Genesis® 2000 sequencing 
kit according to the manufacturer's instructions. Post gel 
processing can be done with the Base Caller 5.0 program 
(DuPont, Boston, MA) . one skilled in the art can readily 
provide alternative methods of performing these steps in the 

15 cloning process. 

Particularly useful vectors for expression of the 
CSV L recombinant antibodies of this invention in mammalian 
• cells are pGIM9 kappa and pNIM9k/hCEM-gamma deposited with the 
ATCC under the requirements of the Budapest Treaty under 
Accession Nos. 75512 and 75511, respectively. These vectors 
comprise human immunoglobulin regulatory elements and contain 
cassette sites for insertion of DMA encoding CDR grafted 
light and heavy chain sequences. These vectors, which are 
especially designed for expressing CDR grafted antibodies and 
25 fragments wherein the acceptor antibody is human, are 

preferably transfected into host cells of the B-cell lineage 
for production of optimal levels of immunoglobulin. Use of 
these vectors is exemplified in the examples below. The 
principal advantage of expressing the CSV L domain in the 
above described vectors in host cells of the B-cell lineage, 
is that this allows for maximal conservation of assembly and 
secretory components to assure reproducible high level 
expression and secretion of the molecules of interest. 

After selection of the transformed cells, these 
cells are grown in culture media and screened for expression 
of the appropriate antibody construct using techniques well 
known in the art for enzyme or radio assay, or by the methods 
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exemplified in Sxample 15 below. Expression of the sequence 
results in the production of the fusion protein of the 
present invention. 

A chelator may also be bound to the CSV L 
5 recombinant antibody through a short or long chain linker 
moiety, through one or more functional groups on the 
antibody, e.g., amine, carboxyl, phenyl, thiol or hydroxyl 
groups, see for example Schlom, -Monoclonal Antibody -based 
Therapy of a Human Tumor Xenograft with a l^Lutetium-labeled 
10 immunocon jugate. • *~^ rrh 31:2889-2896 (1991)- US 

Patent 4,994.560 to Kraper. et ai.; and Sigel, ec ai., ' 
•Coordinating Properties of the Amide Bond. Stability and 
Structure of Metal ion Complexes of Peptides and Related 
Ligands,- Chanel Rpyirw. 82:385-426 (1982). Various 
conventional linkers can be used, e.g.. diisocyanates , 
dusothiocyanates. carbodiimides. bis-hydroxyxuccinimide 
esters, maleimide-hydroxysuccinimide esters, glutaraldehyde 
and the like, for instance, a selective sequential linker 

20 TT a V he ^^""^"iocyanate linker disclosed in U.S. 
20 Patent 4,680,338. 

one of th* T «o S inV6nt f° n also ^templates fusing at least 
one of the genes encoding the CSV L recombinant antibodies to 
a second gene encoding a chelating peptide for binding a 
radiometal ion, a toxin, or an enzyme such that a fusion 
Protein is generated during transcription and translation 
Fusion of two genes may be accomplished by inserting the gene 
encoding the chelating peptide into a particular site on a 
Plasmid that contains an antibody gene, preferably a constant 
region gene, or by inserting an antibody gene into a 
Particular site on a plasmid that contains a gene encoding 
the chelating peptide. 

The plasmid is cut at the precise location that the 
gene is to be inserted using a restriction endonuclease site 
(preferably a unique site). The plasmid is digested, 
Phosphatases and purified as described above. The gene 
encoding the second protein or protein segment is then 
xnserted into this linearized plasmid by ligating the two 
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DNA's together such that the reading frames of the gene 
already in the plasmid and of the gene to be inserted are 
preserved, if the two pieces of DNA to be ligated have blunt 
ends or sticky ends, ligation can be direct using a ligase 
5 such as bacteriophage T4 DNA ligase and incubating the 

mixture at 16 *C for 1-4 hours or overnight in the presence of 
ATP and ligase buffer as described in Section 1.68 of 
Sambrook, et ai., supra , if the ends are not compatible, 
they must first be made blunt by using the Klenow fragment of 
10 DNA polymerase I or bacteriophage T4 DNA polymerase, both of 
which require the four deoxyribonucleotide triphosphates to 
fill in overhanging single-stranded ends of the digested DNA. 

When constructing a replicable expression vector 
containing the DNA, encoding one or more of the chains of the 
15 instant CSV L recombinant antibodies, all subunits can be 
regulated by the same promoter, typically located 5' to the 
DNA encoding the subunits, or each can be regulated by a 
separate promoter suitably oriented in the vector so that 
each promoter is operably linked to the DNA it is intended to 
20 regulate. When the CSV L DNA is composed of subunits, for 

example, the DNA for the heavy and light chains of an intact 
kappabody, generally one of the subunits is fused or operably 
linked to the gene for the chelating peptide, if one is 
included. This fused gene will contain a functional signal 
25 sequence. A separate gene encodes the other subunit or 
subunits, and each subunit generally has its own signal 
sequence. Alternatively, to increase the specific activity 
of the gene fusion product, more than one gene for the 
chelating peptide can be fused to a subunit. For example, 
30 the gene for the chelating peptide can be fused to the genes 
encoding both the heavy and light chains of any antibody or 
antibody fragment, such as an intact kappabody or a heavybody 
or Fab-like fragment. A single promoter can regulate the 
expression of both subunits, or each subunit can be 
35 independently regulated by a different promoter. Thus, 
generally the complementary chain needed to provide the 
binding domain of the protein ligand may be provided by 



WO 96/06625 



PCT/DS95/10791 



53 



10 



15 



20 



25 



30 



35 



expressing the complementary chain as a single polypeptide in 
the host cell or such a single polypeptide can be added 
separately. For example, to produce a fusion protein 
composed of a chelating peptide and an kappabody fragment, a 
gene encoding a light chain (or portion thereof) is 
functionally linked to the chelating peptide gene and this 
hybrid gene is expressed in a host cell. To allow formation 
of the binding domain or double chain fragment (e.g., 
kappabody fragment or ScPv (CSV L ) , the same host cell can be 
engineered to express the other chain and excrete the 
assembled fragment having the chelating peptide attached to 
the corresponding light chain, m another embodiment, the 
chelating peptide can be attached to the light chain and 
expressed alone as a fusion protein, (such as with a CSV L or 
heavybody fragment) or both chains can be attached to 
chelating peptides as fusion proteins and the dimer construct 
can be expressed from a single host cell. 

The molecules of this invention can be used in all 
L n Y i tirn dia 9nostic, in Vivo, diagnostic, and therapeutic 
applications for which antibodies have been used or their use 
proposed. These include naked antibody therapy (both those 
requiring effector function and those only requiring binding 
function), radioimmunotherapy. in vivo 

radioimmunodiagnostics. in YiTXn radioimmunometric assays, 
ELISA assays, quantitative ELISA assays, and 
immunohistochemical applications. 

The scintigraphic imaging method of the invention 
is practiced by injecting a warm-blooded animal preferably a 
mammal, and more preferably a human, parenterally with .an 
effective amount for scintigraphic imaging of the 
radiolabeled monospecific or multispecific antibody agent 
conjugate. By parenterally is meant, e.g. intravenously, 
intraarterial^, intrathecal ly, interstitially or 
intracavitary. For imaging cardiovascular lesions, 
intraveneous or intraarterial administration is preferred. 

labeling with either lodine-131 or lodine-123 is 
readily effected using an oxidative procedure wherein a 
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mixture of radioactive potassium or sodium iodide and the 
antibody is treated with chloramine-T, e.g., as reported by 
Greenwood, et ai. f Biochem. J.. 89:114 (1963) and modified by 
McConahey, et al. f Int. Arch. Allergy AtoI. Ttmrnmril 29:185 
5 (1969). This results in direct substitution of iodine atoms 
for hydrogen atoms on the antibody molecule. Alternatively, 
lactoperoxidase iodination may be used, as described by 
Feteanu, -Labeled Antibodies in Biology and Medicine. - page 
302 (McGraw-Hill Int. Bk. Co., New York, 1978), and 

10 references cited therein. 

Feteanu also discloses a wide range of more 
advanced labeling techniques, supra . pages 214-309. 
Introduction of various metal radio-isotopes may be 
accomplished according to the procedures of Wagner, et al. f 

15 J, Mud, Mftfl», 20:428 (1979); Sundberg, et al., J. M*ri. 

Cllfim.. 17:1304 (1974); and Saha et aJ., J. Mucl. Mfifl., 6:542 
(1976), for instance. 

As used in the methods of the present invention, 
the compounds taught herein can be administered to the 

20 subject animal such as a laboratory animal, a mammal or more 
preferably a human, by any means known to those skilled in 
the art, including parenteral injection or topical 
application. Injection can be done intravascular ly, 
intraperitoneally, subcutaneously or intramuscularly. For 

25 parenteral administration, the compounds can be administered 
in admixture with a suitable pharmaceutically acceptable 
carrier. As used herein the term •pharmaceutically 
acceptable carrier - encompasses any of the standard 
pharmaceutical carriers, such as a phosphate buffered saline 

30 solution, water, and emulsions, such as an oil/water or 
water/oil emulsion, and various types of wetting agents. 

This invention also provides pharmaceutical 
compositions containing any of the CSV L recombinant 

antibodies fused to the metal chelating peptides described 
35 herein linked to protein ligands, with or without the 

radioion having been incorporated into the chelating peptide. 
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Therapeutic formulations of the compositions of 
this invention are prepared for storage by mixing the metal 
chelate-protein complex with optional physiologically 
acceptable buffers and carriers, excipients, or stabilizers. 
( Rem i ngton 1 ft PharmiCMiHrai ScigaSfiS 16th edition, Osol, a. 
Ed... (1980)). in the form of lyophilized cake or aqueous 
solutions. Acceptable carriers, excipients or stabilizers 
are nontoxic to recipients at the dosages and concentrations 
employed, and include buffers such as phosphate, citrate and 
other organic acids; antioxidents including ascorbic acid; 
low molecular weight (less than about 10 residues) 
polypeptides, proteins, such as serum albumin, gelatin, or 
immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; and the like. These pharmaceutical 
compositions are used for in Vivo diagnostic or therapeutic 
purposes . 

The recombinant antibodies of this invention are 
present in the pharmaceutical composition in an effective 
amount. Methods of determining effective amounts are known 
to those of skill in the art and depend upon a variety of 
factors, including the type of disorder, age. weight, sex and 
medical condition of the animal or human patient, the 
severity of the condition, the route of administration, and 
the type of diagnostic or therapeutic treatment desired. A 
skilled veterinarian or physician can readily determine and 
prescribe the effective amount of the compound or 
Pharmaceutical composition required to diagnose or treat the 
animal or patient, respectively. Therefore, the dose of the 
diagnostic compound would be selected to accommodate this 
requirement. For diagnostic applications a typical radiodose 
is between 20 and 30mCi. For instance if the CSv L 
recombinant antibody is an Fab- kappabody fragment the dosage 
is generally in the range between about 1 and 3.0mCi per nmol 
of fragment. As one skilled in the art will appreciate, the 
amount and type of CSV L recombinant antibodies used will 
affect the pharmacokinetics of the compound and one skilled 
in the art would take these considerations into account in 
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selecting the proper compound and dosage in use. 
Conventionally, for therapeutic application, one skilled in 
the art would employ relatively low doses initially and 
subsequently increase the dose until a maximum safe response 
5 is obtained. The specific activity of the compound will 
determine the amount of the compound administered and hence, 
the dosage of the compound containing the radioion 
administered. 

For human therapeutic regimens the typical dosage 

10 of the radioion per injection is in the range from about 10 
to 30mci per injection and the typical corresponding antibody 
dose is in the range from about 2 to lOmg. Although in 
certain instances a single therapeutic dose can be effective, 
more typically the patient to be treated will be administered 

15 a series of gradually increasing doses at intervals spaced 
appropriately to accommodate the needs of the patient. For 
instance, when CSV L recombinant antibody is a kappabody 
fragment, is tumor-specific, and is fused to a chelating 
peptide incorporating Yttrium-90 as the therapeutic radioion, 

20 a typical dosage regimen would consist of repeated 

administration of the therapeutic compound over appropriately 
spaced intervals, for instance of two weeks duration, 
beginning with a dosage of 10mCi/2mg of antibody and 
increasing to a dosage of about 30mCi/10mg of antibody. If 

25 the CSV L recombinant antibody is incorporated into a compound 
containing a separate chelating peptide, the weight of the 
chelating peptide is negligible in comparison to the weight 
of the antibody so that its weight can be ignored in 
calculating the proper ratio of radionuclide to delivery 

30 agent (i.e., chelating peptide plus antibody). 

Alternatively, paramagnetic compounds useful for 
MRI image enhancement can be conjugated to a substrate 
bearing paramagnetic ion chelators or exposed chelating 
functional groups, e.g., SH, NH2# COOH, for the ions, or 

35 linkers for the radical addends. The foregoing are merely 
illustrative of the many methods of radiolabeling proteins 
known to the art. The MRI enhancing agent must be present in 
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sufficient amounts to enable detection by an external camera, 
using magnetic field strengths which are reasonably 
attainable and compatible with patient safety and 
instrumental design. The requirements for such agents are 
well known in the art for those agents which have their 
effect upon water molecules in the medium, and are disclosed 
anter alia, in, e.g., pykett. Scientific Wrlfflin 246:78 
(1982); and Runge, et al., Am. J. BadiflJ 141:1209 (1987). 

The following examples illustrate the manner in 
which the invention can be practiced, it is understood, 
however, that the examples are for the purpose of 
illustration and the invention is not to be regarded as 
limited to any of the specific materials or conditions 
therein. 

Example I 



ECU Clftnlim nf zcr oas v»n,»hi» »~< nT1> , 

The initial cDNA cloning of the ZCE 025 Variable 
regions using the method of Okayama, H. and Berg, p. ( M ol. 
and cell. Biol.. 2:161-170 (1982).- Mol. and Cell. Biol 
3:280-289 (1983)) gave 3' sequences for both the heavy and 
light chains, m order to obtain the 5- sequences, the 
variable regions were isolated using a method termed 'anchor 
PCR- (Loh, E.y. et al., science, 243, 217-220 (1989)). Anchor 
PCR allows the use of a specific heavy or light chain primer 
(in our case, a sequence in the CK or CHI regions) and a 
second poly-c-containing primer that recognizes a poly-G 
sequence added to all the mRNA-derived cDNAs, as is shown in 
Table 1 below. Another advantage of this technique is that 
the upstream primer recognizes an added synthetic segment of 
DNA, making it possible to obtain the native sequence of the 
entire signal region. 



WO 96/06625 



PCT/US95/10791 



- 58 - 

Table 1 

poly c primer c region primer poly A tail 

CCCCCCC xxxxxxx 

5 GGGGG66 XXXXXXX -TTTTTTT 

region amplified 

a* Cloning of ZCB Kappa Light Chain cDNA 

(1) 2CE 025 mRNA was obtained using the Guanidinium HCl 
10 procedure, as described in Sambrook, et al. (supra, 7.18- 

7*22) . 

(2) The first and second strand CDNA syntheses were 
performed using the Stratagene (San Diego, CA) LambdaZap® 
CDNA cloning kit according to the manufacturer's directions 

15 without the incorporation of a radioactive nucleotide. The 
resulting cDNA was ethanol precipitated. 

(3) A poly G tail was added to the 3' ends of the cDNA 
by resuspending the precipitated cDNA in 23 Jll water and 
adding 10 jil 5X tailing salts (0.9m Sodium Cacodylate, 150mM 

20 Tris-HCl (pH 6*8)), 5 |AL ImM Dithiothreitol , 5 jtL 10 mM dGTP, 
5 |il lOmM Cobalt Chloride, 2 jil (40 Units) terminal 
deoxynucleotide transferase (Boehringer Mannheim, 
Indianapolis, IN) and incubating for 1 hour at 37 " . 

(4) The poly G tailed cDNA was digested with Xho i. 
25 This enzyme cleaves the cDNA at an Xho I site within the 

stratagene primer specific to the poly A region of the mRNA 
used for cDNA synthesis and removes the downstream poly G 
tail on the second strand of the cDNA. 

(5) The ZCE 025 Kappa V region was isolated from the 
30 cDNA using the Genearap® PGR kit from Perkin Elmer Cetus 

(Norwalk, CT) according to the manufacturer's instructions. 
The poly G-tailed, Xho I-cut cDNA was used as template with 
the following poly C upstream primer: 

5'GAC TAG CGG CCG CAT CGA TCC CCC CCC CCC CCC C (SBQ, X.D. 
35 No. 3) and a murine Kappa -specif ic downstream primer: 
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5'CAG ACQ TCG ACG ATG GAT ACA GTT GGT GCA GCA TC (SBQ. I.D. 
Mo. 4) The amplification conditions were 94' for 1 min, 45" 
for 1 min. 72' for 3 min for 25 cycles. 

(6) The amplified DNA was digested with Sal I and Not I 
and ligated into the pBluescript® cloning vector from 
Stratagene which vector had been previously digested with Sal 
I and Not i. The ligated mixture was used to transform 
freshly prepared competent cells of the E. coli strain MC1061 
(Clonetech, Palo Alto. CA) . The bacterial cells thus 
transformed were identified by ampicillin resistance. 

(7) Positive colonies were confirmed by restriction 
enzyme analysis and these had inserts of approximately 400 
bp. the expected size for the kappa v region. 

(8) The positive clones were verified by sequence 
analysis on the Genesis® 2000 automated DNA sequencer from 
DuPont (Wilmington. DE) . The cDNA sequence (SBQ. x.d. no 
5) of the light chain variable region of ZCE 025 obtained and 
the corresponding amino acid sequence (sequence I.D. No. 6) 

SBQ. I.D. NO. S 
2CE-025 Light Chain Variable cDNA 
GAC ATT GTG ATG ACC CAG TCT CAA AAA TTT ATG TCC ACA TCA GTT GGA 
GAC AGG GTC AAC ATC ACC TGC AAG GCC AGT CAG AAT GTT CGT ACT GCT 
GTA GCC TGG TAT CAA CAG AAA CCA GGG CAG TCT CCT AAA GCA CTG ATT 
™C TTG GCA TCC AAC CGG TAC ACT GGA GTC CCT GAT CGC TTC ACA GGC 
ATT GGA TCT GGG ACA GAT TTC ACG CTC ATC ATT AGC AAT GTG CAA TCT 
GAA GAC CTG GCA GAT TAT TTC TGT CTG CAA CAT TGG AAT TAT CCT CTC 
ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA C 
381 

SBQ. I.D. No. 6 
Murine ZCE-025 Light Chain Variable Region Amino Acid 
Sequence : 

DIVMTQSQKFMSTSVGDRVNITCKASQNVRTAV^ 
PTGlGSGTDFTLIISNVQSEDLADYPCLQHWNYPLTFGAGTKLELK 
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b. Cloning of ZCE Gamma cdna 

(1) ZCE 025 mRNA was obtained using the Guanidinium HCl 
procedure, as described in Sambrook, et al. t (supra, 7.18- 
7.22) . 

5 (2) cDNA was prepared using the method described in 

Example l.a., above, for the ZCE kappa light chain. 

(3) A poly G tail was added to the 3' ends of the cDNA 
as described in Example l.a., above. 

(4) The poly G tailed cdna was digested with Xho I and 
10 the ZCE 025 Gamma variable region was isolated from the cDNA 

using the GeneaaJp® PCR kit from Perkin Elmer Cetus according 
to the manufacturers instructions. The poly G-tailed, Xho-I 
cut cDNA was used as template with the following poly C 
upstream primer: 

15 5'GACTAGCGGCCGCATCGATCCCCCCCCCCCCCCC (SBQ. X.D. NO. 3) 
and a murine Gamma 1 specific downstream primer: 
5'CAG ACG TCG ACG TTC CAG GTC ACT GTC ACT GGC TC (SBQ* 1.0. 
NO. 7) The amplification conditions were 94 • for 1 min, 45* 
for 1 min, 72* for 3 min for 40 cycles. 

20 (6) The amplified DNA was digested with Sal I and Not I 

and ligated into the pBluescript® cloning vector (Stratagene, 
San Diego, CA) which had been previously digested with Sal I 
and Not I. The ligated mixture was used to transform freshly 
prepared competent cells of the E. coli strain MC1061. The 

25 bacterial cells thus transformed were identified by 
ampicillin resistance. 

(7) Positive colonies were confirmed by restriction 
enzyme analysis and these had inserts of approximately 450 
bp, the expected size for the Gamma chain variable region. 

30 (8) The positive clones were verified by sequence 

analysis on the Genesis® 2000 automated DNA sequencer from 
DuPont (Wilmington, DE) , according to the manufacturer's 
instructions. The cDNA sequence (SBQ. I.D. NO. 8) of the 
ZCE heavy chain variable region obtained and the 

35 corresponding amino acid sequence (SBQ* Z.D. NO. 9) 
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SKQ. t.D. MO. 8 

2CE-025 Heavy chain Variable cDNA Sequence: 

GAT GTG CAG CTG GTG GAG TCT GGG GGA GGC TTA GTG CCG CCT GGA GGG 
TCC CGG AAA CTC TCC TGT GCA GCC TCT GGA TTC ACT TTC ACT AAC TTT 
GGA ATG CAC TGG ATT CGT CAG GCT CCA GAG AAG GGA CTG GAG TGG GTC 
GCA TAC ATT AGT GOT GGC ACT AGT ACC GTC CAC TAT GCA GAC TCC TTG 
AAG GGC CGA TTC ACC ATC TCC AGA GAC AAT CCC AAG AAC ACC CTG TTC 
CTA CAA ATG ACC AGT CTA AGG TCT GAA GAC ACG GCC ATG TAT TAC TGT 
GCA AGA GAT TAC TAC GTT AAT AAC TAC TGG TAC TTC GAT GTC TGG GGC 
GCA GGG ACC ACG GTC ACC GTC TCC TCA G 
420 

SBQ. I.D. NO. 9 

Murine ZCE-025 Heavy Chain Variable Region Amino Acid 
15 Sequence: 

WQLVESGGGLVPPGGSRKLSCAASGFTFSl^GMHWIRQAPEKGLEWAYISGGSSTVHYA 
DSLKGRFTISRDNPKNTLFI^ 

20 

(a oBiag-nnfl flnm i rnmlT ir i no Vitrnr h«» TO „„ n1n r m m 

The human Plasmacytoma cell line IM9 (ATCC #159) 
25 expresses an IgG iy^ # K) immunoglobulin . 

a. Extraction of IM9 mRNA. 

A total of 8X107 im9 cells were used for mRNA 
purification by the Fast-Trak« kit from Invitrogen (San 

30 Daego, California) using an enzyme mix to digest the cells 
and oligo dT resin to adsorb the polyadenylated mRNA from the 
cell lysate according to manufacturer's directions The 
resulting mRNA was redissolved in 100*11 of sterile water and 
splxt into lOul aliquots. Each aliquot was stored at -20" in 

35 ammonium acetate and ethanol. 
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b. Synthesis of cDNA. 

The synthesis of a cDNA library was performed using 
a Librarian kit (Invitrogen) . The pooled mRNA from four of 
the tubes in a* was quant ita ted by measuring absorbance at 
5 260nm. The first strand cDNA synthesis was performed 
according to manufacturer's directions using an oligo-dT 
primer and reverse transcriptase in the presence of 
deoxyribonucleotides and RNAase inhibitors. Second strand 
synthesis was begun immediately by addition of ribonuclease 
10 H, E. coli ligase, and DNA polymerase in the presence of the 
appropriate buffer. The reaction was extracted once with 
phenol /chloroform and precipitated* The pellet was 
resuspended in sterile water and ligated with BstXI linkers 
supplied with the kit. 

15 

c. Purification of cDNAs* 

The products of cDNA synthesis and linker ligation 
were separated by size on an agarose gel in TAB (tris acetate 
EDTA) buffer (see Sambrook, et al., supra). The cDNA 

20 molecules over 700bp were cut out of the gel and separated 

from the agarose by electroelution into a small volume of TAE 
buffer (0.04M Tris-acetate, 0.001M EDTA), The cDNA was 
extracted once with phenol /chloroform and precipitated. The 
sample was centrifuged, and the pellet was rinsed with 

25 ethanol, then air-dried. 

d. Vector construction and transformation of 
bacterial cells. 

The purified cDNA was ligated to the vector 
30 provided in the kit, pCONAII, which is already cut with an 

enzyme that leaves the appropriate sticky ends for the linker 
used on the cDNA and not for relegation to itself. The 
ligation mixture was electroporated into the E. coli strain 
DH10B (ElectroMAX) (BRL, Gaithersburg, Maryland) using the 
35 Cell-Porator (BRL) at 330uF, 2.5kv. The total number of 
colonies obtained in this library was 1.8X10 6 clones . 
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e. Preparation of Filter lifts of the IKS cDNA 
library. 

The library was inoculated onto LB agar media 
(950ml deionized water, bacto-tryptone lOg, bacto-yeast 
extract 5g, NaCl lOg) with ampicillin at 7500cfu (colony 
forming units) per 15cm plate, a total of 12 plates were 
made for a total of 9xl0<cfu of cdna clones. The colonies 
were blotted onto nylon filters by placing a dry filter on 
the colonies and removing the filter. The plates were 
returned to the incubator to allow the bacteria to grow back. 
The filters were placed on a layer of Whatman filter paper 
saturated with 5% SDS. 2 x SSC and put into the microwave 
oven on a high setting for 10 minutes. The filters were air- 
dried and stored at 4°c. 

f - Primary Screening of the IM9 cDNA library. 

The filters were incubated at 4S°C in 
prehybridization buffer (2XSSC, 1%SDS, 0.5% nonfat dry milk). 
These were then hybridized with human ig mixed kappa and 
gamma constant region probes using a method and probes 
described in C.B. Beidler, et ml., supra. The probes were 
labeled using a Prime-It® kit (BRL) in 6 x SSC, 1% SDS, 0.5% 
nonfat dry milk, at 65*C overnight. The filters were washed 
with 6 x SSC, 1% SDS, three times at 65'C, 5 minutes each 
25 time, then with 1 x SSC, 0.1% SDS. three times at 65"C, 20 

minutes each time. The filters were put on Kodak XAR-5 X-ray 
film at room temperature overnight. 
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fl. Secondary Screening of the IM9 cDKA library. 

Sixty-two positive colonies were picked from the 
Plates and streaked onto LB agar media in duplicate, twelve 
to a plate, for two sets of six plates. These were blotted 
on nylon filters and hybridized using a method and probes 
described in C.B. Beidler, ec al.. supra. One set was 
hybridized with a kappa constant region probe and one with a 
gamma constant region probe. 
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h. Tertiary and Quaternary Screening of the ZM9 
cdna library. 

The streaks that were positive for the kappa probe 
5 were picked and plated out on LB media with ampicillin. 
These plates were blotted as described before, the filters 
were hybridized, and the positives were picked. These clones 
were subjected to one more round of blotting and 
hybridization to prove that the clone was pure* The sequence 
10 is provided below as Sequence I.D* 10 and the amino acid 
sequence is provided as Sequence l.D. 11. 

SBQ. I.D. No. 10 

GAC ATC CAG ATG ACC CAG TTT CCT TCC ACC CTG TCT GCT TCT GTA GGA 
GAC AGA GTC ACC 60 

ATC ACT TGT CGG GCC AGT CAG AGT ATT AGT GCC TGG TTG GCC TGG TAT 
CAG CAG AAA CCA 120 

GGG AAA GCC CCT AAA CTC CTG ATC TAT AAG GCG TCT AGT TTA GAA AGT 
GGG GTC CCA TCA 180 

AGG TTC AGC GGC AGT GGA TCT GGG ACA GAG TTC ACT CTC ACC ATC ACC 
AGC CTG CAG CCT 240 

GAT GAT TTT GCA ACT TAT TTC TGC CAA CAC TAT AAT CGA CCG TGG ACG 
TTC GGC CAA GGG 300 
ACC AAG GTG GAA ATC AAA GCA 

IM9 Light Protein SBQ I.D. Ho. 11 

DIQMTQFPSTLSASVGDRWITCRASQSISAWIAWYQQKPGKAPKXJJIY 
3 o KASSLESGVPSRFSGSGSGTEFTLTITSLQPDDFATYFCQHYNRPWTFGQGTKVEIK 

i . Southern blot and sequence analysis of light 
chain cDNA clones. 

Ten putative kappa light chain clones were raised 
35 in LB broth with ampicillin. The plasmids were purified by 
the miniprep method of Holmes and Quigley (D.S. Holmes and M. 
Quigley, Analytical Biochemistry, 114:193, 1981). The 
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miniprep DNA was characterized by restriction enzyme mapping 
and Southern blot analysis. The longest of the cDNA inserts 
obtained (clone kappa LI) was 1.2kb. This clone was 
sequenced on the Genesis 2000 automated DNA sequencer 
(DuPont, Wilmington, Delaware) as described previously. 

i. Rescreening, Southern blot, and sequence analysis 
of heavy chain cDNA clones. 

None of the gamma clones was positive, so the 
library was rescreened as described in f . with the gamma 
constant region as a probe. The positives from this 
screening were picked and rescreened as described above in g. 
and h. until pure cultures were obtained. The putative 
clones were raised and. characterized as described in i. and 
15 two gamma cDNA clones were found. The clones were both 1.6kb 
in length. The clones were sequenced on the Genesis 2000 
automated DNA sequencer (DuPont) as described previously. 
The Sequence is provided below as Sequence I.D. 12 and the 
corresponding amino acid Sequence is provided as Sequence 
20 I.D. 13. 

SBQ. I.D. No. 12 

GAA ATG CAA CTG GTG GAA TTT GGG GGA GGC CTG CTA CAG CCT GGC AGG- 
GCC CTG AGA CTC 60 

25 TCC TGT GCA GCC TCT GGA TTC AGG TTT GAT GAT TAT GCC ATG CAC TGG 
GTC CGG CAA ACT 120 

CCA GGG AAG GGC CTG GAG TGG GTC GCA GGT ATT AGT TGG AAT ACT GAC 
ACC ATA GAC TAT 180 

GCG GAC TCT GTG AAG GGC CGA TTC ACC ATC TCC AGA GAC AAC GCC AAG 
30 AAC TCC CTC TAT 240 

TTG CAA ATG AAC AGT CTC AGA GCT GAG GAC ACG GCC TTG TAT TAC TGT 
ACA AAA AGA AGG 300 

GGG GTG ACA GAC ATT GAC CCT TTT GAT ATC TGG GGC CAA GGG ACA ATG 
GTC ATC GTC TCT 360 
35 TCA GAG 366 
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IM9 HEAVY PROTEIN SBQ I.D. No. 13 

EMQLVEFGGGLLQPGRALRLSCAASGFRFDDYAMHWVRQTPGKGLEWVAGISWNSDTIDYA 
DSVKGRFTISRDNAKNSLYLQMNSLRAEOTALYYCTKRRGVTDIDPFDIWGO^TMVIVSS 



aaaanifl a 

Definition of gfcrtigfcnrallv COaaflTgfld Raaionn an HI 
10 identification of chain aaaoeiafcloTi re*iAumn JUtiOa 

kaam taree-fl i nfmnjonM atmgfem^a Q f «« feibodii*g. 

a . Definition of Light Chain SCRs 

First, the linear amino acid sequences of the light 
15 chain variable regions of a set of antibodies with known 

three-dimensional structures were compared. Eight sequences 
[Table 2] were compared in this example, but more or less 
may be used, by linear display of one sequence above the 
other on the computer screen [Figure 6 J (SBQ. ID No. 14- 
20 21) 

Table 2 



Antibody 

Identifier PDB File Mamo Nam* 



Source. 



Resolution 



30 



35 



40 



MCP 


PDB1MCP.ENT 


MCPC603 


Mouse 


2.1k 


FAB2 


PDB 4 F ABSENT 


4-4-20 


Mouse 


2.1k 


HFL 


PDB2HFL . ENT 


HYHEL-5 


Mouse 


2.54A 


FDL 


PDBlFDL . ENT 


Dl.3 


Mouse 


2.5A 


FBJ 


PDB2FBJ.ENT 


J539 


Mouse 


1.95A 


FAB1 


PDB6FAB,ENT 


36-71 


Mouse 


1*9A 


FAB 


PDB3 FAB . ENT 


NEW 


Human 


2. OA 


FB4 


PDB2FB4.ENT 


KOL 


Human 


1.9A 
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SBQ I.D. No 14 

DIVMTQSPSSLSVSAGERVTMSCKSSQSLLNSGNQKNFLAWYQQKPGQPPK 
LLiyGASTRESGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCQNDHSyPLTFGAGTKL 

SBQ I.D. No 15 

DWMTQTPLSLPVSLGDQASISCRSSQSLVHSQGNTYLRWYLQKPGQSPKV 
LIYKVSNRFSGVPDRFSGSGSGTDFTLKISRVEAEDI^VYFCSQSTHVPWTFGGGTKLE 

SBQ I.D. NO 16 

DIVLTQSPAIMSASPGEKVTMTCSASSSVNYMYWyQQKSGTSPKRWiyDTS 
KLASGVPVRFSGSGSGTSySLTlSSMETEDAAEYYCQQWGRNPTFGGGTKLEIK 

SBQ I.D. NO 17 

DIQMTQSPASLSASVGETVTITCRASGNIHNYIAWYQQKQGKSPQLLVYYT 
TTIADGVPSRFSGSGSGTQySLKINSLQPEDFGSYYCQHFWSTPRTFGGGTKLEIK 

SBQ I.D. No 16 

EIVLTQSPAITAASLGQKVTITCSASSSVSSLHWYQQKSGTSPKPWIYEIS 
KLASGVPARFSGSGSGTSYSLTINTMEAEDAAIYYCQQWTYPLITFGAGTKLELK 

SBQ I.D. NO 19 

DIQMTQIPSSLSASLGDRVSISCRASQDINNFLNWYQQKPDGTIKLLIYFT 
SRSQSGVPSRFSGSGSGTDYSLTISNLEQEDIATYFCQQGNALPRTFGGGTKLEIK 

SBQ I.D. No 20 

SVLTQPPSVSGAPGQRVTISCTGSSSNIGAGNHVKWYQQLPGTAPKLL IFH 
NNARFSVSKSGSSATLAITGLQAEDEADYYCQSYDRSLRVFGGGTKLTVL 

SBQ I.D. NO 21 

QSVLTQPPSASGTPGQRVTISCSGTSSNIGSSTVNWYQQLPGMAPKLLIYR 
DAMRPSGVPDRFSGSKSGASASLiAIGGLQSEDETDYYCAAWDVSLNAYVFGTGTKVTVL 

Using the Insight II Homology software to 
facilitate the three-dimensional alignment of these 
structures, a landmark amino acid, known to be universally 
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conserved among antibodies, such as the cysteine at L23 
(Kabat, E. A. , et al ., sentiences of Proteins of Immunological 
interest . Vol 1 edition, U.S. Department of Health and Human 
Services, PHS, NIH, Bethesda, Maryland (1991)) was identified 
5 in each sequence. The sequences were vertically aligned on 
the computer screen . 

Now, taking the first two of the linearly aligned 
sequences, one was designated to be held constant and the 
other to be superimposed onto the first (in practice, the 

10 bottom sequence on the display was held constant due to the 
program design.) A one residue box was drawn around the 
aligned cysteines. Then, using the commands for manual 
alignment of structures, the program determined the minimum 
RMS (Root Mean Square) deviation, after applying the optimum 

15 rotation and translation, of any boxed region. The minimum 
number of residues required in a box by this program before 
RMS deviation can be calculated in this way is three. As an 
integral part of this process a visual representation of the 
superimposed structures is displayed on the screen. A three 

20 residue box was made, using the program, centered on the 

residue of interest (here, the cysteine L23). The meaning of 
the box within this program is to mathematically superimpose 
the structures using the backbone atoms of the amino acids 
within the box. The box was moved horizontally one residue 

25 in each direction, sequentially. The position giving the 

lowest RMS deviation for the superposition of backbone atoms 
of the three amino acids from the linearly aligned sequences 
was selected. 

The object of this preliminary step was to 

30 approximately super impose the two structures, allowing 
structurally conserved regions (SCRs) to be discerned 
visually. Having achieved this objective, the box was now 
deleted. Using the already superimposed structures, SCRs 
[usually found in the regions of the beta sheets, but also in 

35 the other portions of the framework regions! were discovered 
by visual inspection. Using the Homology program, as 
described previously in this section, a box was made around 
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the amino acid sequences that the SCR comprises. Gaps were 
introduced in the structurally non-conserved (NSCR) regions 
to align the SCR sequences. 

For each SCR defined in this way. the structures of 
each of the other six sequences were superimposed, 
sequentially, in each case holding the same sequence (first 
sequence) constant, and the appropriate boxes were 
determined. Then, the second SCR was identified for the 
initial two structures and the process was repeated working 
through each of the SCRs for all of the sequences (for 
example working from amino to carboxy terminus) . Once the 
second set of SCRs was superimposed, the program was directed 
to superimpose the two structures based on all of the 
backbone atoms of the residues of both of the sets of SCRs. 
This process was also repeated for each of the subsequent 
sets of SCRs. 

Now that SCR boxes had been determined for each of 
the sequences, consensus boxes were determined for each SCR. 
Consensus boxes represent the maximum number of amino acid 
positions (e.g. L60-L65 in Figure 6) contained in all of 
the SCR boxes at a particular site, m this example seven 
concensus SCR boxes were formed as shown in Figure 6. 

b. Definition of heavy chain SCRa using known three- 
dimensional structures of antibodies. 

First, the linear amino acid sequences of the heavy 
chain variable regions of a set of antibodies with known 
three-dimensional structures (we used eight sequences in this 
example, but more or less may be used) [Table 2] were 
compared by linear display of one sequence above the other on 
the computer screen [Figure 7] (SBQ I.d. No. 22-29) 

S8Q I.D. Mo 22 

EVKLVESGGGLVQPGGSLRLSCATSGFTFSDFYMEWVRQPPGKRLEW1AAS 

RNKGNKYTTEYSASVKGRFIVSRiyrSQSILYLQMNALRAEDTAIYYCARNYYGSTWYFDW 
GAGTTVTVSS 
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SBQ I. P. NO 23 

EVKLDETGGGLVQPGRFMKLSCVA^ 
5 YSDSVKGRTCISRDDSKSSVYLQMNl^RVEDM^^ 



SBQ I.D. MO 24 

VQLQQSGAEI^PGASVKISCKASGYTFSDYWIEWVKQRPGHGLE^GEILPGSGSTNYHE 
10 RPKGKATFTADTSSSTAYMQLNSLTSEDSGVYYCLHGNYDFDGWGQGTTLTVSS 



SBQ I.D* Mo 25 

QVQLKESGPGLVAPSQSLSITCTVSGFSLTC^ 
15 ALKSRLSISKDNSKSQWLKMNSLHTDDTARYYCARERDYIUjDYWGQGTTLW 



SBQ I.D* Mo 26 

EVKLLESGGGLVQPGGSLKLSCAASGFDFSKYWMSWVRQAP 
20 PSLKDKFIISRDNAKNSLYLQMSQVRSEDTALYYCARLH^ 



SBQ I.D, MO. 27 
EVQLQQSGVELVRAGSSVKMSCKASGYTFTSNGINWVKQRPG 
2 5 EKFKGKTTLTVDKSSSTAYMQLRSLTSEDSAVYFCARSEYYGGSYKFDYWGQGTTLTVSS 



SBQ I.D. MO 28 

VKLEQSGPGLVRPSQTLSLTCWSGTSFDDYYSTWVRQPPGRGLEWIGYVFYHGTSOT 
3 0 LRSRVTMLVOTSKNQFSLRLSSVTAADTAVYYCARl^IAGCI 



SBQ I.D. MO 29 

EVQLVQSGGGWQPGRSLRLSCSSSGF I FSSYAMYWVRQAPGKGLEWVAI IWDDGSDQHYA 
3 5 DSVKGRFTISRlTOSIQmJ^MDSLRPEDTGVYF 
TVSS 



WO 96/06625 



PCT/US9S/10791 



71 



30 



Using the Insight ii homology software to 
facilitate the three-dimensional alignment of these 
structures, a landmark amino acid, known to be universally 
conserved among antibodies, such as the cysteine at H22 
^ (Rabat, E. a., fit a l. ,minrn ) was identified in each sequence 
The sequences were vertically aligned on the computer screen 
Now, taking the first two of the linearly aligned 
sequences one was designated to be held constant and the 
other to be superimposed onto the first, (m practice, the 
bottom sequence on the display was held constant due to the 
program design., a one residue box was drawn around the 
aligned cysteines. Then, using the commands for manual 
alignment of structures, the program determined the minimum 
RMS devotion, after applying the optimum rotation and 
15 translation, of any boxed region. The minimum number of 
residues required in a box by this program before RMS 
deviation can be calculated in this way is three. As an 
integral part of this process, a visual representation of the 
superimposed structures is displayed on the screen, a three 
residue box was made, using the program, centered on the 
residue of interest (here, the cysteine H22, . The meaning of 
the box wxthin this program is to mathematically superimpose 
the structures using the backbone atoms of the amino acids 
within the box. The box was moved horizontally one residue 
in each direction, sequentially, and the position giving the 
lowest RMS deviation for the superposition of backbone atoms 
of the three amino acids from the linearly aligned sequences 
was selected. 

The object of this preliminary step was to 
approximately superimpose the two structures, allowing SCRs 
to be discerned visually. Having achieved this objective 
the box was now deleted. Using the already superimposed ' 
structures, SCRs [usually found in the regions of the beta 
sheets, but also in other portions of the framework regions] 
are discovered by visual inspection and put within boxes 
xncludmg appropriate amino acids, guided by the RMS 
deviations, once vertically aligned, the box was expanded in 
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both directions to include more amino acids, until the RMS 
deviation became unacceptable (usually >0.75 A). Then the 
size of the box was reduced to the size which had the last 
acceptable RMS deviation. Gaps are introduced in the 
5 structurally non-conserved (non-homologous) regions to help 
align the SCRs vertically. 

For each SCR defined in this way, the structures of 
each of the other six sequences were super imposed, 
sequentially, in each case holding the same sequence (first 

10 sequence) constant, and the appropriate boxes were 

determined. Then, the next SCR was identified for the 
initial two structures and the process was repeated working 
through each of the SCRs for all of the sequences (for 
example working from amino to carboxy terminus) . Once the 

15 second set of SCRs was superimposed, the program was directed 
to superimpose the two structures based on all of the 
backbone atoms of the residues of both of the sets of SCRs. 
This process was also repeated for each of the subsequent 
sets of SCRs. 

20 now that SCR boxes had been determined for each of 

the sequences, consensus boxes were determined for each SCR. 
Consensus boxes represent the maximum number of amino acid 
positions (e.g. H3-H6 of antibody FB4 in Figure 7) contained 
in all of the SCR boxes at a particular site. Thus, the 

25 amino acids contained in each consensus SCR box are 

structurally conserved among all of the database antibodies 
under consideration. In this example ten concensus SCR boxes 
were formed as shown in Figure 7. 

30 c. Identification of Chain Association residues in 
known structures. 

For each of the known structures used in defining 
the SCRs, described above, chain-association residues were 
identified. First, all residues from the light chain which 
35 contain any atom which is within about 4.5 A of any atom of 
any heavy chain residue, except those of the Kabat-def ined 
CDRs, were identified. This set was then limited to those 
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which by orientation have a significant likelihood of 
interaction with any atom of that heavy chain residue. All 
residues from the heavy chain which contain any atom which is 
within about 4.5A of any atom of any light chain residue, 
5 except those of the Rabat -defined CDRs, were also identified, 
again limited to those which have a significant likelihood of 
interaction. This process was carried out for each of the 
antibodies of known structure shown in Table 2. 

0 Example 4 



Three-dlmftnalnr.«1 Mn^Hr g of ggg 

a. Three-dimensional modeling of ZCE Light chain 
15 variable domain. 

The three dimensional coordinates had not been 
determined for ZCE Fv. For this reason, homology modeling 
was used to approximate the actual structure. The following 
four steps were used: (1) alignment of the ZCE light chain 

20 variable region sequence with the aligned sequences of the 
set of light chain variable regions of known structure 
described in Example 3. a.; (2) homology modeling of SCRs 
using SCRs from the known light chain variable region 
structures; (3) homology modeling of NSCRs using the full 

25 range of known structures available in the Brookhaven 

database, and (4) a series of energy minimizations carried 
out to obtain an energetically favorable structure. 



30 



d) ft l iqnmfnr of ace 025 light- rh*in amioaQ an* «»m, fnrf , wirh 

am i no acid sem i ftnrPS of known light- oh* in st-pi."^^ The 
linear sequence of the ZCE 025 light chain variable region, 
determined from a cDNA clone as described in Example l.a.. 
above, was displayed and aligned with the database sequence's 
described in step 1 above, using the Insight II software. As 
35 described for the database sequences, the first step was to 
align the ZCE 025 sequence with the database sequences using 
the first consensus SCR box. This was accomplished by first 
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identifying which residues (one or more) within the box were 
most highly conserved between the known structures, 
identifying these residues in the ZCE sequences, and aligning 
them. In some cases only a subset of the residues identified 
5 as conserved appeared in the zce sequence. In these 
instances, the subset was aligned. Working from one 
concensus SCR box to the next (in this example, we worked 
from amino to carboxy terminus) the process was repeated, 
where necessary, gaps were introduced either into regions 

10 other than those corresponding to SCRs (i.e. NSCRs) from ZCE 
or into identical positions within the SCRs of each of the 
aligned known structures. 

The object of this preliminary step was to align 
the ZCE sequence with the sequences of the other light chain 

15 variable regions of known structure. In each case great 
effort was made to identify the potential locations of ZCE 
SCRs by linear sequence homology to the consensus regions 
alone. The result of this alignment is shown in Figure 8. 

20 (2) - Three dimensional modeling of scrs. For each SCR, the 
actual known structure whose sequence has the greatest 
homology to the corresponding ZCE light chain SCR was 
selected as template for that segment, and its coordinates 
were assigned to the ZCE SCR. if there were a residue in a 

25 template SCR that did not match the corresponding residue in 
the ZCE SCR residue, the residue in the template was mutated 
to match the ZCE SCR residue, while maintaining the 
coordinates of all the atoms in the backbone and side chains 
of the template residue that correspond to those in the ZCE 

30 residue and modeling the remaining atoms tinder the 

constraints of maintaining the same bond lengths, angles and 
dihedrals as those in the original database residue, e.g., 
for gamma and delta carbons. This was done for each SCR (we 
worked from amino to carboxy terminus). After all of the 

35 SCRs were assigned coordinates in this manner a partial 

three-dimensional structure comprising the modeled SCRs was 
displayed, absent the NSCRs. 
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( 3) - Thrpp riimftnsinnai wv^u™ nf ^ rni For each 2CE 
light chain NSCR, the flanking SC Rs of assigned coordinates 
were used along with the length of the NSCR to identify a 
known structure with the greatest likelihood of being 
homologous to the SCR/NSCR/SCR array. This was accomplished 
by using the -Loop Search- subprogram in Insight II to search 
the database for structures (1) containing proper lengths of 
flanking and spanning sequences and (2) having coordinates 
for the flanking sequences with the least RMS deviation from 
those of the assigned SCRs. In practice, approximately ten 
structures (more or less can be used) were ranked by the 
program on the basis of rms deviation of the flanking 
sequences. These were sequentially displayed on the screen 
superimposed on the flanking SCRs. The structure best 
approximating the flanking sequences and having the same 
general orientation as NSCRs from light chain variable 
regions of known structure was chosen as template for that 

20 ^ tiCUlar NSCR ^ itS coordinat es were assigned to the ZCE 
NSCR. This process was then repeated for each NSCR, and the 
NSCRs were added to the computer model by inserting each in 
xts appropriate place, for instance flanked by the adjoining 
Until Che entire variable region had been modeled. 

t il - Energy Minimisations " f modeled nrni r rn rr Energy 

minimizations were carried out in stages to assure that no 
major structural disruptions would occur. Once all of the 
NSCRs making up the model had in turn been selected from the 
database, fixed in space, and modeled to transform them into 
the corresponding ZCE NSCRs , the splice regions where the 
SCRs join the NSCRs were refined to relieve any strain in the 
model that would result from joining the SCRs and NSCRs , 
using the -Repair- algorithm to assign the proper bond 
lengths, bond angles, and omega values to the structures. 

Now, the -Relax- algorithm was applied in a series 
of sequential steps to the model as a whole: (l, to the side 
chains of the NSCRs to assign proper geometries, and remove 
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any unfavorable non-bonded contacts between side chain atoms 
and other atoms in the molecule, (2) to all atoms of the 
NSCRs, and (3) to the mutated side chains of the SCRs. In 
each of these steps, all regions other than those that are 
5 being relaxed remain fixed to their assigned coordinates. 

Finally, an energy minimization analysis was 
performed using the "Discover* subprogram to allow the model 
to assume an energetically favorable structure. First the 
entire model was subjected to energy minimization with 

10 backbone atoms tethered to their starting coordinates with a 
defined force constant (usually 100 Kcal/A 2 ) . Then energy 
minimization was performed on the entire molecule without the 
backbone atoms being tethered. 

The result of carrying out these steps was a model 

15 of the ZCE light chain. 

b. Three-dimensional modeling of ZCB 025 Heavy chain 
variable domain. 

Like the light chain, coordinates had not been 
20 determined for ZCE heavy chain. For this reason, homology 
modeling was again used to approximate the actual structure . 
The same steps were used for heavy chain as for light. 

( 1) - ft lj qnniftnr Qf ZCE 025 heavy chain amino arid ggmi»nr f 
25 With amino acid sequences of known h^ w chain structure . 

The linear sequence of the ZCE heavy chain variable region, 
determined from a cDMA clone as described in Example l.b. f 
above, was displayed and aligned with the database sequences 
described in Example 3.b. above, using the Insight II 
30 software. As described for the light chain, the first step 
was to align the ZCE sequence with the database sequences 
using the first consensus SCR box. The remainder of the 
process was precisely as described for the light chain, with 
the final alignment displayed in Figure 9. 

35 

(2) - Three dimensional modeling of ste«. For each SCR, the 
actual known structure whose sequence has the greatest 
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homology to the corresponding ZCE SCR was selected as 
template for that particular SCR (these are shown in bold in 
Figure 9 and its coordinates were assigned to the ZCE SCR 
This was done for each SCR (we worked from amino to carboxy 
terminus). After all of the SCRs were assigned coordinates 
xn this manner a partial three-dimensional structure was 
displayed, absent the NSCRs. 

□J - Thrpp flimrnsinnm moirlimi of mrm . as for light 

chain, for each ZCE heavy chain NSCR, the flanking SCRs which 
had been assigned coordinates were used along with the length 
of the NSCR to identify a known structure with the greatest 
likelihood of being homologous to the SCR/NSCR/SCR array, 
using the -Loop Search- subprogram in insight II. This 
process was then repeated for each NSCR, until the entire 
variable region had been modeled. 

<* ) - BnffHTv Minimizations of modeled nmir rurr Energy 

minimizations were carried out in stages, as for the light 
chain, to assure that no major structural disruptions would 
occur. The process was identical to that described for the 
light chain, including (1, use of the -Repair- algorithm to 
assign the proper bond lengths, bond angles, and omega values 
to the structures; (2) use of the -Relax- algorithm to assign 
proper geometries and remove any unfavorable non-bonded 
contacts; (3) use of the -Discover- subprogram to allow the 
model to assume an energetically favorable structure. Once 
again, the entire molecule was first subjected to energy 
minimization with backbone atoms tethered to their starting 
coordinates with a defined force constant (usually 100 
Kcal/A2). Then energy minimization was performed on the 
entire molecule without the backbone atoms being tethered. 

The result of carrying out these steps was a model 
of the ZCE heavy chain. 
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c. Three-dimensional modeling of ZCB 035 heavy and 
light chains together to form Fv. 

Now that coordinates had been determined for both 
light and heavy chain, these were displayed on the screen 
5 together. Since energy minimization has been carried out on 
each chain separately, the first step was to carry out the 
same procedure on the chains as a set. Finally, an energy 
minimization was performed using the "Discover 41 subprogram to 
allow the model to assume an energetically favorable 

10 conf iguration. First, potential chain association residues 
for ZCE light and heavy chains were identified by comparison 
with the chain association residues of the known structures 
(determined in Example 3.c). Chain association residues in 
the aligned sequences were compared with residues in the 

15 corresponding position in ZCE. When an identical residue was 
present, it was designated as favorable to chain association; 
if a different residue was found, it was designated as 
potentially disrupting to chain association. Totals of 
favorable and disrupting residues were determined for the 

20 comparison of ZCE light and heavy chains to each of the known 
structures. The known structure providing the comparison 
having the greatest excess of favorable residues over 
disruptive residues was chosen as template for ZCE 
heavy/light association* If two or more known structures had 

25 the same excess of favorable over disruptive residues, the 
structure having the greatest number of favorable residues 
over disruptive residues was chosen. In this example 2hfl 
was chosen* 

Next, the light chain structure determined for ZCE 
30 in Example 4. a. was superimposed on that of the light chain 
structure of 2HFL, using the backbone coordinates of the 
favorable residues described above. This was carried out 
using the •superimpose" command in the insight II software. 
The same was done for the ZCE heavy chain using the 2HFL 
35 heavy chain* Next the entire molecule was subjected to 
energy minimization with backbone atoms tethered to their 
starting coordinates with a defined force constant (usually 
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100Kcal/A 2 ). Then energy minimization was performed on the 
entire light/heavy association model without the backbone 
atoms being tethered. 

d. Identification of CDR-Aesociated Residues in ZCB 
025 Pv. 

For ZCE. the aim of modeling is to identify regions 
that must be conserved to conserve the function of the CDRs. 
To do this it is necessary to (1) identify all potential CDR- 
associated residues and (2) identify the subset of these 
which have a reasonable likelihood of a significant 
interaction with the CDR residue involved. 

To determine which amino acids lying outside of the 
Rabat -defined CDRs may influence binding of the CDRs, all the 
amino acid residues outside of the Rabat-defined CDR regions 
that have an atom located within about 4.5 Angstroms of any 
atom in an amino acid located in the Rabat-defined CDR 
regions of the ZCE construct were identified as CDR- 
associated residues. As these were predicted to be important 
for maintaining the binding specificity of the ZCE antibody, 
they were earmarked for preservation as donor amino acids in 
the CDR grafted antibody construct in addition to those in 
the defined CDR regions. 

we first identified all residues on the light or 
heavy chain that have atoms that are within 4.5 A of any 
atoms of any light chain CDR residue. The set was limited to 
those with a significant likelihood of interaction, based on 
orientation of the residue, charge, hydrophobicity, etc. 
Next, all residues on the light or heavy chain that contain 
atoms which are within 4.5 A of any atom of any heavy chain 
CDR residue were identified. Again, the set was limited to 
those with a high likelihood of significant interaction with 
the CDR residue of interest, in this way. the entire set of 
light and heavy chain CDR-associated residues was determined. 
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Example Jg 

TtUMSt fltmensionnt modpUng of two Eg 

a. Throe-dimensional modeling o£ IK9 Light chain 
variable domain. 

Homology modeling was used to approximate the 
structure of the IM9 antibody. This process had four steps: 
(1) alignment of the IM9 light chain variable region sequence 
with the aligned sequences of the set of light chain variable 
regions of known structure. (See Example 3. a. above); (2) 
homology modeling of the JM9 light chain SCRs using SCRs from 
che known light chain variable region structures; (3) 
homology modeling of NSCRs ( non-structural ly conserved 
regions) using the full range of known structures available 
in the Brookhaven database (other known structures could also 
be used); and (4) a series of energy minimization routines to 
determine the energetically preferred structure. 

(1) Alignment of IMS light chain amino acid sequence 
with amino acid sequences of known light chains. 

The linear dna sequence [SBQ. I.D. Ho. 10] of the 
IM9 light chain variable region was determined from a cDNA 
clone as described in Example 2.1. : 

The linear amino acid sequence [SBQ. i.d. no. 11] 
of the IM9 light chain variable domain was displayed on the 
computer screen and aligned with the sequences of the eight 
light chain variable regions of known structure described in 
Example 3. a. above, using the insight II software. The IM9 
sequence was aligned with the database sequences using the 
first consensus SCR box. The residues (one or more) within 
the box which were most highly conserved between the known 
structures were identified, after which the corresponding 
residues in the IM9 sequences were identified and the 
structures were aligned, when a subset of the residues 
identified as conserved in the known structures appeared in 
the DO sequence the subset was aligned. The alignment 
proceeded from one consensus SCR box to the next as described 
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above for the known sequences. The alignment proceeded from 
amino to carboxy terminus, but would work as well if 
reversed. Gaps were introduced either into regions other 
than those corresponding to SCRs (i.e. NSCRs) from IM9 or 
into identical positions within the SCRs of each of the 
aligned known structures when necessary for alignment. 

This preliminary step allowed alignment of the IM9 
sequence with the sequences of the other light chain variable 
regions of known structure. The potential locations of IM9 
SCRs were identified by linear sequence homology to the 
consensus regions. This alignment is shown in Figure 10. 

(2) Three-dimensional modeling of r M 9 light chain 

SCRS . 

For each SCR, the actual known light chain 
structure whose sequence had the greatest homology to the 
corresponding XM9 light chain SCR was selected as the 
template for that segment (these are shown in bold in 
figure 10 and its coordinates assigned to the IM9 SCR. m 
instances where a residue in a template SCR did not match the 
corresponding residue in the IM9 SCR, the coordinates of all 
the atoms in the backbone and sidechains of the template 
resxdue that correspond to those in the 1M9 residue were 
maintained. The remaining atoms (e.g., for gamma and delta 
carbons and the atoms bonded to them) were modeled under the 
constraints of maintaining the same bond lengths, angles and 
dihedrals as those in the original database residue. This 
was done for each SCR (we worked from amino to carboxy 
terminus) . After all of the SCRs were assigned coordinates 
in this manner a partial three-dimensional structure 
comprising the modeled SCRs was displayed, absent the NSCRs . 

(3) Three-dimensional modeling of ZM9 light chain 
NSCRs . 

For each no light chain NSCR, the flanking SCRs 
which had been assigned coordinates were used along with the 
length of the NSCR to identify a known structure with the 
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greatest likelihood of being structurally homologous to the 
SCR components of the SCR/NSCR/SCR array. In addition, the 
known structure containing a region corresponding to the NSCR 
component of the aforementioned SCR/NSCR/SCR array, is 
5 identified which has an orientation most like that of the 
corresponding region of the antibodies of known structure. 
This was accomplished by using the "Loop Search* subprogram 
in Insight II to search the database for structures (1) 
containing proper lengths of flanking and spanning sequences 

10 and (2) having backbone coordinates for the flanking 

sequences with the least RMS deviation from those of the 
assigned SCRs. In practice, a maximum of ten structures 
(more or less can be used depending on the limitations of the 
program used) were ranked by the program on the basis of RMS 

15 deviation of the coordinates of the backbone atoms of the 

flanking sequences. These were sequentially displayed on the 
screen superimposed on the flanking SCRs. The structure best 
approximating that of the flanking sequences, having the same 
general orientation as NSCRs from light chain variable 

20 regions of known structure, and having a minimum of 

structurally significant mutations was chosen as template for 
that particular NSCR and its coordinates were assigned to the 
NSCR. This process was then repeated for each NSCR, until 
the entire variable region had been modeled. 

25 
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(4) Energy Minimisations of modeled IM9 light chain 
structure. 

Energy minimizations were carried out in stages to 
assure that no major structural disruptions would occur. 
First the splice regions where the SCRs join the NSCRs were 
refined to relieve any strain in the model that would result 
from joining the SCRs and NSCRs, using the -Repair- algorithm 
to assign the proper bond lengths, bond angles, and omega 
values to the residues in the splice region. 

Then, the -Relax- algorithm was sequentially 
applied to the regions as follows: (1) to the sidechains of 
the NSCRs to assign proper geometries, and remove any 
unfavorable non-bonded contacts between NSCR sidechain atoms 
and other atoms in the molecule; (2) to all atoms of the 
NSCRs to remove remaining unfavorable contacts between the 
NSCR and other atoms in the molecule; (3) to the altered side 
chains of the SCRs to remove any unfavorable non-bonded 
contacts between mutated SCR side chain atoms and other atoms 
in the molecule, and (4) to all the sidechain atoms of the 
SCR to remove remaining unfavorable side chain contacts. In 
each of the above described steps, all regions other than 
those which are being -relaxed- remain fixed to their 
assigned coordinates. 

Finally, an energy minimization was performed using 
the -Discover- program to allow the model to assume an 
energetically favorable structure. First the entire model 
was subjected to energy minimization with backbone atoms 
tethered to their starting coordinates with a defined force 
constant (usually 100 Kcal/A 2 ). Then energy minimization was 
performed on the entire molecule without the backbone atoms 
being tethered. 

The result of carrying out these steps was the 
homology model of the IM9 light chain. 
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b. Three-dimensional modeling of IH9 Heavy chain 
variable domain. 

The steps used to model the IM9 heavy chain are 
similar to those used in modeling the IM9 light chain. 

(1) Alignment of IN 9 heavy chain amino acid sequence 
with amino acid sequences of known heavy chain 
structures. 

The linear DNA sequence [SBQ. I.©, ho. 12] of the 
10 no heavy chain variable region was determined from a cDNA 
clone as described in Bxample 2.j. 



15 



20 



The linear amino acid sequence [SBQ. x.D. HO. 13] 
of the IM9 heavy chain variable domain was displayed and 
aligned with the database sequences, described in 
Bxample 3.b. above, using the insight II software. As 
described for the light chain, the first step was to align 
the IM9 sequence with the database sequences using the first 
consensus SCR box. The remainder of the process was 
precisely as described for the light chain, with the final 
alignment displayed in Figure 11. 



(2) 



Three-dimensional modeling of IM9 heavy chain 
SCRs. 



For each SCR, the actual known structure whose 
sequence has the greatest homology to the corresponding IM9 
SCR was selected as the template (shown in bold in 
Figure 11) and its coordinates assigned to the corresponding 
M9 SCR. The process - working from amino to carboxy 
> terminus - was repeated for each SCR. After all of the SCRs 
were assigned coordinates a partial three-dimensional 
structure was displayed, absent the NSCRs. 

(3) Three-dimensional modeling of im9 heavy chain 
NSCRs . 

As for the light chain, for each IM9 heavy chain 
NSCR, the flanking SCRs which had been assigned coordinates 
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were used along with the length of the NSCR to identify a 
known structure with the greatest likelihood of being 
structurally homologous to the SCR components of the 
SCR/NSCR/SCR array. 

In addition, the known structure containing a 
region corresponding to the NSCR component of the 
aforementioned SCR/NSCR/SCR array, is identified which has an 
orientation most like that of the corresponding region of the 
antibodies of known structure. This was accomplished by 
using the "Loop Search" subprogram in Insight II to search 
the database. This process was then repeated for each NSCR. 
until the entire variable region had been modeled. 

(4) Energy Minimizations of modeled IM9 heavy chain 
structure. 

Energy minimizations were carried out in stages, as 
for the light chain, to assure that no major structural 
disruptions would occur. The process used was in substantial 
accordance with that described for the light chain. The 
process comprised the following steps: (1) use of the 
-Repair- algorithm to assign the proper bond lengths, bond 
angles, and omega values to the splice regions; (2) use of 
the "Relax" algorithm to assign proper geometries and remove 
any unfavorable non-bonded contacts from the mathematical 
model; (3) use of the "Discover- subprogram to allow the 
model to assume an energetically favorable structure, as 
described for the light chain, the entire molecule was first 
subjected to energy minimization with backbone atoms tethered 
to their starting coordinates with a defined force constant 
(usually 100 Kcal/A 2 ) . Then energy minimization was 
performed on the entire molecule without the backbone atoms 
being tethered. 

The resultant structure was used as the model of 
the IM9 heavy chain. 
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c. Three-dimensional modeling of IM9 heavy and 
light chains together to form Fv. 

The coordinates determined for the light and heavy 
chain, were used to generate a model of the Fv. As an 
initial step, potential chain association residues for IM9 
light and heavy chains were identified by comparison with the 
chain association residues of the known structures 
(determined in Example 3.c). Chain association residues in 
the aligned sequences were compared with residues in the 
corresponding position in IM9. when an identical residue was 
present, it was designated as favorable to chain association; 
if a different residue was found, it was designated as 
potentially disrupting to chain association. Totals of 
favorable and disrupting residues were determined for the 
comparison of IM9 light and heavy chains to each of the known 
structures. The known structure providing the comparison 
having the greatest excess of favorable residues over 
disruptive residues was chosen as template for IM9 
heavy/light association. If two or more known structures had 
the same excess of favorable over disruptive residues, the 
structure having the greatest number of favorable residues 
was chosen as template. In this example, FDL was chosen. 

Next, the light chain structure determined for IM9 
in Example 5. a. was superimposed on the template light chain 
structure of FDL, using the backbone coordinates of the 
favorable residues described above. This was carried out 
using the -superimpose" command in the Insight II software. 
The same was done for the IM9 heavy chain using the FDL heavy 
chain. Next the entire molecule was subjected to an energy 
minimization with the backbone atoms tethered to their 
starting coordinates with a defined force constant (usually 
100 Kcal/A 2 ) . Then an energy minimization was performed on 
the entire light/heavy associated (Fv) model without the 
backbone atoms being tethered. 
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d. Identification of Chain-Association Residues in 
IM9 Pv. 

The regions of IM9 that should be conserved to 
allow for optimal associations between the chains in regions 
other than those that will be replaced (the CDRs and CDR 
associated regions) was determined by (1) identification of 
all chain association residues; (2) identification of all CDR 
associated residues; and (3) delineation of the not CDR- 
associated subset of chain association residues. The 
individual steps are described in detail below. 

Residues from the light chain that contain an atom 
that is within about 4.5A of any atom of any heavy chain 
residue were identified. This set was then limited to those 
residues that have a significant likelihood of interacting 
with that heavy chain residue (or any other). All residues 
from the heavy chain containing an atom that is within about 
4.5 A of any atom of any light chain residue were identified, 
again limited to those that have a significant likelihood of 
interaction. Next, all residues on the light or heavy chain 
that contain an atom that is within about 4.5A of any atom of 
any light chain CDR residue were identified. Again the set 
is limited to those with a significant likelihood of 
interaction. Next, all residues on the light or heavy chain 
that contain an atom that is within about 4.5 A of any atom 
of any heavy chain CDR residue with a high likelihood of 
significant interaction with the CDR residue of interest were 
identified. Finally, the subset of chain association 
residues not contained within either set of CDR-associated 
residues was determined and classed as IM9 chain-association 
residues. 
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a. Modeling of CDR-grafted ZCB/IM9 light chain 
5 variable region. 

The IM9 and ZCE light chain amino acid sequences 
were aligned with reference to the sequences of the eight 
known structures. On the ZCE linear array, the Kabat-defined 
CDRs and the CDR -associated residues determined in Example 4 
10 were identified. For SCR or NSCR regions which do not 

contain a CDR or CDR-associated residue, the entire region 
was replaced with the IM9 sequence. For SCRs which contain 
one or more CDR or CDR-associated residues, the non-CDR and 
non-CDR-associated residues were replaced with IM9 sequence, 
but the ZCE sequence was conserved for the CDR or CDR- 
associated residues. For NSCRs which contain one or more CDR 
or CDR-associated residues, the replacement is dependent upon 
the relative lengths of the region of interest in acceptor 
and donor molecules, if the NSCR has the same number of 
20 residues in both the acceptor (IM9) and the donor (ZCE) 

molecules, the non-CDR associated residues were replaced with 
acceptor (IMS) sequence, if however, the NSCR differs in 
number of residues between the acceptor and donor, the donor 
(ZCE) sequence was conserved for the entire segment, in this 
25 way the primary sequence for the light chain CDR-grafted 
molecule was determined. The residues of the CDR-grafted 
primary sequence were assigned coordinates to match those of 
the residues in the light chain sequences of the superimposed 
models of ZCE and no from which they were derived. This was 
I done working from amino to carboxy terminus. 

b. Modeling of CDR-grafted ZCR/IM9 heavy chain 
variable region. 

The IM9 and ZCE heavy chain amino acid sequences 
were aligned with reference to the sequences of the eight 
known heavy chain structures. On the ZCE linear array, the 
Kabat-defined CDRs and the CDR-associated residues determined 
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in Example 4 were identified. For SCR or NSCR regions which 
do not contain a CDR or CDR-associated residue, the entire 
region was replaced with the no sequence. For SGRs which 
contain one or more CDR or CDR-associated residues, the non- 
CDR and non-CDR-associated residues were replaced with no 
sequence, but 2CE sequence was conserved for the CDR or CDR- 
associated residues. For NSCRs which contain one or more CDR 
or CDR-associated residues, the ZCE sequence was conserved 
for the entire region, in this way the amino acid sequence 
for the heavy chain CDR-grafted molecule was determined. The 
coordinates of the residues of the CDR-grafted primary 
sequence were obtained from those of the residues in the 
heavy chain sequences of the superimposed models of ZCE and 

15 Zirl IT* *"£ ^ ^ deriVed ' This — ^ne working from 
15 ammo to carboxy terminus. 

c. Modeling of Humanised ZCB Pv. 

Now that coordinates had been assigned for both 
Ught and heavy chain, these were displayed on the screen 
together. An energy minimization was performed using the 
Discover- subprogram to allow the model to assume an 
energetically favorable structure. First the entire model 
was subjected to energy minimization with backbone atoms 
tethered to their starting coordinates with a defined force 
constant (usually 10O Kcal/A*, . Then the energy minimization 
was performed on the entire model without the backbone atoms 
being tethered. 

d. Modification of the humanised ZCB 025 model so 
that only CDR-associated residues found in the 
murine zcb 025 model meet the definition of cdr- 
associated residues. 

CDR-associated residues were determined for the 
modeled humanized ZCE Fv in substantial accordance with the 
methodology taught for the original ZCE Fv. First all 
residues on the light or heavy chain which contain'atoms 
which are within about 4.5 A of any atoms of any light chain 
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CDR residue, which also has a significant likelihood of 
interaction, based on orientation, charge, hydrophobicity, 
etc, were identified . Next, all atoms of all residues on the 
light or heavy chain which are within about 4.5 A of any 
5 atoms of any heavy chain CDR residue were identified. The 
set was then limited to those with a high likelihood of 
significant interaction with any atoms of the CDR residue of 
interest. The entire set of light and heavy chain CDR- 
associated residues was thusly determined. 

10 The set of CDR-associated residues determined for 

the humanized Fv was compared to that determined for the ZCE 
Fv. in any case where an additional CDR-associated residue 
was present in the humanized, the amino acid at that position 
was replaced by the amino acid found in the murine ZCE. in 

15 the case where a CDR-associated residue in ZCE was not 
identified as CDR-associated in the humanized ZCE and is 
found in a NSCR, the entire NSCR was changed to the donor 
(ZCE) sequence. 

20 e. confirmation of Chain-association residues. 

After the CDR-associated residues were modified if 
necessary as described above, the model was analyzed to 
determine if the chain association residues identified for 
IM9 were conserved. In this exaiqple, they were conserved. 
25 If. however, differences are observed, they are noted, but no 
changes are made at this time. If, in addition, a 
significant decrease in secreted protein is observed for the 
humanized molecule, these are potential sites for 
modification. 

30 The amino acid sequences for light and heavy chain 

hZCE, determined above, are shown in Figure 7 and Figure 8, 
respectively. 
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Mode li ng of hzcK-rsvi. and hZCBda ft 

a. Modeling of hzcB-csvL. 

The IM9 light and ZCE heavy chain primary amino 
acid sequences had already been aligned with reference to 
different sequences. Therefore, it was necessary to bridge 
these alignments through realignment using a common sequence. 
The IM9 heavy chain sequence was used for this purpose as 
shown in Figure 12. m addition, the IM9 heavy chain 
provided information on chain association residues. ZCE 
heavy chain sequence was added and aligned with the linear 
array containing light chain ZCE and light and heavy chain 
15 IM9 sequences. Once aligned in this manner, SCRs were 

defined there between as described in Example 3, the Rabat 
defined CDRs and CDR-associated residues determined in 
Example 4, were identified on the ZCE heavy chain linear 
array. For SCR or NSCR regions which do not contain a CDR or 
CDR-associated residue, the entire region was replaced with 
the IM9 light chain sequence (and structure, i.e., 
coordinates) . For SCRs which contain one or more CDR or CDR- 
associated regions, the non-CDR-associated residues were 
replaced with IM9 sequence (and structure, i.e., 
coordinates), but ZCE heavy chain sequence (and structure, 
i.e.. coordinates) was conserved for the CDR-associated 
residues. For NSCRs that contain one or more CDR or CDR- 
associated residues, the ZCE heavy chain sequence (and 
structure, i.e., coordinates) was conserved for the entire 
» region, in this way the primary sequence for the heavy chain 
CDR-grafted molecule was determined, and a composite 
structure was developed. 

Now. the resultant model was modified to assure 
that chain association residues, derived from the IM9 model 
were conserved. In all non-CDR or non-CDR-associated 
regions, when the amino acid in the position occupied by the 
chain association residue was different than the 
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corresponding IM9 heavy chain residue, it was replaced with 
the corresponding IM9 heavy chain chain-association residue. 
In this example no chain-association residues were found to 
lie in the CDR or CDR-associated regions, in the unlikely 
event that this should occur, the residue should be noted, 
but no change should be made. In addition one residue 
leucine at position 94, of the mature CSVL was changed to a 
methionine. The final amino acid sequence of the mature CSVL 
is shown in Figure 13 [seq I.d. Mo. 30] 

SBQ i.d. Ho. 30 

DIQMTQFPST LSASVGDRVN ITCRASGFTF SNFGMHWIRQ KPGKGLKWVA 
YISGGSSTVH YADSLKGRFT ISRDNPKNEL FLTITSLQPD DFAMYYCARD 
VYVNNYWYFD VWGQGTKVEI KR (122 residues) 

Alternatively, hZCE light chain can be used as 
acceptor and hZCE heavy chain can be used as donor, in this 
case, chain association residues used for the preliminary Fv 
model are those identified for hZCE FV. 

b. Model hZCB-kb Fv. 

Now that coordinates had been assigned for both 
light and heavy/light hybrid chains, these were displayed on 
the screen together. An energy minimization was performed 
using the -Discover- subprogram to allow the model to assume 
an energetically favorable configuration. First the entire 
model was subjected to energy minimization with backbone 
atoms tethered to their starting coordinates with a force 
constant of 100 Kcal/A2. Then the energy minimization 
algorithm was applied to the entire model without the 
backbone atoms being tethered. 

c. Modify to assure ao added CDR-associated 
residues . 

CDR-Associated residues were determined for the 
modeled humanized 2CE light chain dimer as for the original 
ZCE Fv of Example 4.d. Again, this was done by first 
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identifying all residues on the light or heavy/light hybrid 
chain that are within 4.5 A of any light chain Cdr residue, 
and that also have a significant likelihood of interaction, 
based on orientation of the residue, charge, hydrophobicity. 
etc. Next, all residues on the light or heavy/ light hybrid 
chain that were within 4.5 A of any heavy /light hybrid chain 
CDR residue were identified. Again, the set was limited to 
those with a high likelihood of significant interaction with 
the CDR residue of interest, in this way, the entire set of 
light and heavy/light hybrid chain CDR-associated residues 
was determined. 

The set of CDR-associated residues determined for 
the humanized light chain dimer was compared to that 
determined for the ZCE Fv. m any case where an additional 

15 CDR-associated residue is seen for the humanized, the amino 
acid at that position was replaced by the amino acid found in 
the murine ZCE. Care should be taken in this step as these 
replacements would be dependent upon whether that residue 
lies in an SCR or NSCR segment as explained in Example 6 

20 above. 

d. Confirm Chain-association residues. 

After the CDR-associated residues were modified as 
necessary, the model was analyzed to determine if the chain 

25 association residues identified for IM9 were conserved, in 
this example, they were conserved, if, however, differences 
are observed, these are noted, but no changes are made at 
this time; if there is a significant decrease in expression 
observed. for the humanized molecule, these are potential 

30 sites for modification. 
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Construction nf exprftimion vpnhnr. PflT MQir n n t 1 

DQIIl9K/hZgg-fc«« rn 

a. Construction and Screening the IM9 Genomic 

library in b. coli Bacteriophage Lambda for the 
Iff Kappa Gene. 

IM9 genomic DNA was extracted and purified using 
methods described in Sambrook (supra, pp. 9.4-9.30). The DNA 
was partially digested with ifcal and separated by sucrose 
density gradient ultra-centrifugation. The gradients were 
fractionated and the aliguots were analyzed for size by 
agarose gel electrophoresis, as described in Sambrook (supxa, 
PP. 6.3-6.19). The fractions between 8-20 Kb were pooled, 
and dialyzed against TE Buffer (10 mM Tris HCl; 1 mM EDTA. pH 
7.4). -Tris- is [Tris (hydroxymethyl) amino methane]. 

The 1M9 DNA was ligated to Lambda EMBL3 arms 
(commercially available from Stratagene, San Diego, 
California) and packaged with the lambda bacteriophage 
packaging kit. Gigapack® Gold (stratagene). The recombinant 
bacteriophage particles were used to transfect E. coli strain 
P2/392. which was inoculated onto 1% NZY agar medium in 140 
mm diameter plates. The lambda library contained 6.55 X 10* 
individual clones, and was amplified by plating at 3.3 X 10« 
plaques per plate on twenty plates and suspending the 
bacteriophage in 200 ml total of SM buffer (5.8 g NaCl 2 g 
MgSO 4 .6H 2 0. 50 ml 1 m TrisHCl, pH 7.5. and 5 ml 2% gelatin per 
liter) . 

The library was plated as described in Sambrook 
(supxa. pp. 2.61-2.63). on twenty. 140 mm agarose plates at 
2.5 x 104 plaques per plate. The lambda phage plaques were 
blotted onto nitrocellulose and treated with denaturing and 
neutralizing solutions followed by baking at 80°C in a vacuum 
oven. Filters were then pre-hybridized in 50% formamide 5 x 
SSC (75 mM Na citrate; 750 mM NaCl) . 0.1% S DS, 5 x Denhardfs 
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solution (0.1% bovine serum albumin (BSA) , 0.1% ficoll, 0.1% 
polyvinylpyrrolidone) , 200 jig/ml yeast tRNA, 100 ug/ml salmon 
sperm DNA at 42°C for 2 hours. Fragments of human 
immunoglobulin kappa chain DNA were labeled with a Prime-It® 
kit (commercially availble from Stratagene) in substantial 
accordance with the directions provided by the manufacturer, 
and hybridized with the blots overnight in hybridization 
solution (50% formamide, 5 X SSC. 0.1% SDS . 1 x Denhardfs 
solution (0.02% BSA, 0.02% ficoll, 0.002% 
polyvinylpyrrolidone), 100 ug/ml salmon sperm DNA) at 42°C. 
The blots were washed twice at 42°C in 2 x SSC and 0.1% sds 
for 20 minutes, then at 65°c in 0.2 x SSC, 0.1% SDS for 20 
minutes and exposed to XAR-5 X-ray film (commercially 
available from Eastman Kodak Corp. ) overnight at -70°C 
15 between two intensifying screens. 

The positive plagues were picked and subjected to 
two rounds of phage DNA purification as described in Sambrook 
(Slicca, pp. 2.73-2.76) . The purified phage DNA was analyzed 
by restriction enzyme mapping and Southern blot, as described 
in Sambrook isuDLA. pp. 9.31-9.57). Figure 14 provides a 
restriction map of the IM9 kappa gene in bacteriophage lambda 
EMBL3 . 

b. Subcloning the Intact Kappa Oene into 
25 pBluescript® 

Southern Blot analysis was used to map the intact 
kappa chain gene to an 8.8 Kb fiamHl fragment. This fragment 
was isolated from the lambda phage DNA by digestion with 
fiamHl followed by agarose gel electrophoresis. The 8.8 Kb 
BflmHI fragment was ligated using T4 DNA ligase (commercially 
available from Life Technologies, inc.) following 
manufacturers instructions, with pBluescriptOSK" 
(commercially available from Stratagene. San Diego, CA) which 
had been previously digested with fiafflHi. Restriction 
endonuclease mapping revealed the 5 ■ end of the gene was 
adjacent to the £aci end of the polylinker. 
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in order to facilitate modification of the gene, 
the 5' end of the gene was then sub-cloned as a BamH i to 
BatEIl fragment containing the igK promoter, the variable 
exons and a portion of the major intron. The fistEll 
restriction endonuclease leaves a 5' overhang that is not 
compatible with any of the sites in the pBluescript®SK _ 
poly linker, so it was necessary to modify the overhanging 
sequence to make it blunt ended. This was carried out by 
digesting the pBluescript®SK- clone described above with 
BSLEII and filling in the 5* overhang with Klenow fragment 
and a solution of all four deoxyribonucleotides , using the 
method described in Sambrook ( supra , pp. 5.40-5.43). This 
was followed by BaoHl digestion and isolation of the 2.2 Kb 
fragment by agarose gel electrophoresis. This fragment was 
ligated with pBluescriptGSK" , previously digested with ecor v 
(which leaves a blunt end) and fiamHl. The resulting plasmid 
is shown in Figure 15. The DNA sequence of the clone was 
determined as described in Example 1, above. 

c. Engineering the 5» End of the Gene to Create 

Onique Sfil Sites Flanking the Variable Bxon and 
Removing an Mstii site to Make the Sites Flanking 
the Constant Bxon Onique. 

Two oligonucleotide primers were synthesized on a 
Millipore DNA synthesizer (Bedford, MA) , following 
manufacturers instructions, for mutagenesis of the 5' end of 
the IM9 kappa gene: primer B239 (SBQ. I.D. ho. 31) 
TAGTGGATCCAACTGATTTCTCCAT upstream for the BanjHi site at the 
5* end of the kappa gene and primer B240 (SBQ. I.D. no. 32) 
TTATTTACTTCTGGGTCACCAGGTTTATTC downstream for the £s£EII site 
in the major intron. The downstream primer recreates the 
BSLEII site that had been altered in the previous step for 
insertion into pBluescript®SK- . 

Two SiLl sites were designed to flank the variable 
region exon, each having a unique sticky end so as not to re- 
ligate to each other in cloning but to allow for forced 
orientation cloning of synthetic variable region cassettes 
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for CDR grafted antibodies. Each Sill site, the upstream 
Sill site and the downstream sill site, involved the design 
of a pair of oligonucleotide primers and a round of overlap 
mutagenesis (see Figure 16) as described in d.h. Jones and 
B.H. Howard, "A rapid method for recombination and site- 
specific mutagenesis by placing homologous ends on DNA using 
polymerase chain reaction-, Binr^hn^,^ ifl:62-66 (1991). 

Two PCR reactions were performed, each using the 
variable exon clone as the template. The first used the 5- 
flanking primer B239 as the 5- primer and the upstream Sill 
primer B435 (SEQ. i.p. H0 . 33) 

AAGAGGCCGAGCTGGCCCTTCCCTGAATAACCAGGCAGT as the 3' primer 
The second used the 3' flanking primer B240 as the 3- primer 
and the upstream Sill primer B434 (SBQ. i.d. no. 34) 
GGGAAGGGCCAGCTCGGCGTGTTCCTATAATATGATCAA as the 5' primer. 
The products of these reactions were purified and used 
together as templates in an overlap PCR reaction with primers 
B239 and B240 as shown in Figure 16. The product of the 
overlap reaction was the full fiaoHi to fiatEll fragment and 
contained an Sil site in the appropriate upstream location. 
This product was used as the template in a new pair of 
reactions to install the downstream Sfi site in a similar 
manner, using primers B379 (sbq. i.d. no. 35) 
TTCCTGGCCCTGCAGGCCCAGTTGTCTGTGTCTTCTGTT and B380 (SBQ. I.D 
»0. 36) AACTGGGCCTGCAGGGCCAGGAAGCAAAGTT -TAAATTCTA . The PCR 
was performed according to the instructions in the GeneAmp® 
PCR kit (commercially available from Perkin Elmer-Cetus 
Norwalk. CT) on a Thermal Cycler® (commercially available 
from Perkin Elmer Cetus) . The reaction was performed for 30 
cycles of one minute at 94»C, one minute at 55'c, and two 
minutes at 72'c in a buffer that contained a 1.5 mM final 
concentration of MgCl2. 

The product of the PCR reaction was cloned into 
pCR™li vector using a TA Cloning™ Kit (both commercially 
available from Invitrogen) in substantial accordance with the 
manufacturer's protocol. The identity. of the clone was 
verified by restriction mapping to be the IM9 kappa fiamHl to 
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SSLEII fragment with two engineered £fjj sites of the 
appropriate size and location. 

The MatH site upstream of the kappa promoter, 
shown in Figure 16, was destroyed by linearizing the clone 
described above with UsLH and filling in the 5- overhang to 
make a blunt end. Re-ligation of the modified ends yielded a 
sequence that no longer contained an Mstit site. 

The clone described above was characterized by dna 
sequencing analysis as described in Example 1. 

The engineered fiamHi to BatEII fragment was 
isolated from pCR™n by PCR using two primers, B495 and B496 
(SBQ. I.D. No. 37 CATGTCTGGATCCAACTGATTT and SBQ. I. D . 
Mo. 38 CTGATTTACTTCTGGGTGACCAGGTTTATTCAA respectively) '. 

d. Ligation of the Kappa Gene Fragments with the 
pSV2gpt (Bnhancer minus). 

The mutated fiajBHi to BatEII fragment from the Sfii 
mutagenesis, described in Example 8.c. still contained the 
native IM9 kappa variable region sequence. It was then 
ligated with the flatEII to dal fragment taken from the 
pBluescript®SK- clone and the P sv2gpt (enhancer minus) £lai 
to fiaoHi fragment (Beidler, £t al. BUBCa ) . 

The resulting clone was analyzed by restriction 
enzyme mapping. Southern blot analysis, and DMA sequence 
analysis. The confirmed sequence is provided as a 
restriction map in Figure 17. 

e. insertion of an hzcB Kappa Variable Bxon into the 
pGlM9kappa Vector Using the Engineered sfil 
Sites. 

The hZCE kappa variable region was taken from a 
PCR1000™ clone using PCR mutagenesis according to the 
manufacturer's instructions to add the Sfii sites at the 5- 
35 and 3- ends. The oligonucleotide B510 (SBQ. i.d. NO. 39) 
5 " -AAGGGCCAGCTCGGCCT- 

CTTCCTATAATATGATCAATAGTATAAATATTTGTGTT1OTATTTCCAATCTCAGGTGCCA 
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AATGTGACATCCAGATGACCCA-3 ' was used as the 5' end primer and 
B511 (SBQ. I.D. NO. 40) 5'- 

TGGGCCTGCAGGGCCAGGAAQCAAAGTTTAAATTCTAC- 
TCACGTTTGATTTCCACCTTGGTT-3 ' as the 3' end primer. The 
resulting PCR fragment was digested with sfil. The plasmid 
pGlM9kappa. deposited with the ATCC with accession number 
75512, was also digested with Sfil resulting in three 
fragments. The hZCE kappa variable region containing 
fragment described above was ligated with the largest two of 
the three fragments resulting from the pGDWkappa digestion. 
This was carried out as a three fragment ligation reaction. 
The three Sfil sites have different overhanging sequences due 
to the nature of the Sfil recognition sequence and so 
oriented cloning of the three fragments into pGIM9kappa was 
achieved. The resulting clone pGlM9k/hZCE-kappa was verified 
by DMA sequence analysis as having the correct variable exon 
sequence. 

g pngtrncugn nnfl ftttbclnninrr of h^.r.«m. » nT l f 

a. Construction of hZCB-CSVL gene. 

The amino acid sequence derived above for the hZCE 
CDR-grafted CDR switched variable light region was converted 
into DNA sequence using software from DNA STAR (Madison. WI). 
Six oligonucleotides with overlapping ends and spanning the 
sequence of the hZCE-CSV L gene were synthesized on a 
Millipore DNA synthesizer (Bedford, MA) . The sequences of 

the six oligonucleotides comprising the template are provided 
as 

SBQ. I.D. Nofl. 41-46: 

B695 s 5' -GGG-AAG-GGC-CAG-CTC-GGC-CTC-TTC-CTA-TAA-TAT-GAT- 
CAA-TAG-TAT-AAA-TAT-TTG-TGT-TTC-TAT-TTC-CAA-TCT-CAG-GTG-CCA- 
AAT-GTG-ACA-TCC-AGA-TGA-CCC-AGT-TTC-CT- 3. (SBQ. I.D. NO. 
41) 
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B696 = 5' -GCA-TGC-CGA-AGT-TGG-AGA-AGG-TGA-AGC-CGG-AGG-CGC- 
GGC-AGG-TGA-TGT-TCA-CGC-GGT-CGC-CCA-CGG-AGG-CGG-ACA-GGG-TGG- 
AAG-GAA-ACT-GGG-TCA-TCT-GGA-TGT- 3 (SBQ. I.D. MO. 42)' 

B549 = 5' -GGC-TTC-ACC-TTC-TCC-AAC-TTC-GGC-ATG-CAC-TGG-ATC- 
CGC-CAG-AAG-CCC-GGC-AAG-GGC-CTG-AAG-TGG-GTG-GCC-TAC-ATC-TCC- 
GGC-GGC-TCC-TCC-ACC-GTG-CAC-TA- 3 (SBQ. I.D. NO. 43) ' 

B550 = 5' - GGT -GAT -GGT -CAG -G AA-C AG -CTC -GTT -CTT - GGG -GTT - GTC - 
GCG-GGA-GAT-GGT-GAA-GCG-GCC-CTT-CAG-GGA-GTC-GGC-GTA-GTG-CAC- 
GGT-GGA-GGA-GCC-GCC -GGA-GAT-GTA- -3 (SBQ. I.D. NO. 44) • 

B697 = 5' -CCC-CAA-GAA-CGA-GCT-GTT-CCT-GAC-CAT-CAC-CTC-CCT- 
GCA-GCC-CGA-CGA-CTT-CGC-CAT-GTA-CTA-CTG-CGC-CCG-CGA-CTA-CTA- 
CGT-GAA-CAA-CTA-CTG-GTA-CTT-CGA-CGT-GT (SBQ. I.D. NO. 45) 

B698 = 5' -CAC-AGA-CAA-CTG-GGC-CTG-CAG-GGC-CAG-GAA-GCA-AAG- 
TTT-AAA-TTC-TAC-TCA-CGT-TTTG-ATC-TCC-ACC-TTG-GTG-CCC-TGG-CCC- 
CAC-ACG-TCG-AAG-TAC-CAG-TAG-TT (SBQ. I. D. No. 46) 

The six oligonucleotides were used in a PCR reaction using 
Taq polymerase and two additional oligonucleotide primers, 
B553 (SBQ. I.D. No. 47) 5' -GGG-AAG-GGC-CAG-CTC-GGC-CTC-TT 
-3' and B554 (SBQ. I.D. No. 48) 5 ' -CAC-AGA-CAA-CTG-GGC-CTG- 
CA- 3' for amplification. The oligonucleotide templates, 
primers. PCR reagents and buffers were used at concentrations 
described by the manufacturer. Twenty five cycles of 
amplification were carried out, as follows: (1) Denature at 
94 C for one minute, anneal at 55 *C for one minute, and 
extend at 72 C for one minute. 

b. Subcloning of hZCB-CSVL gene into TA Vector. 

Following PCR synthesis of the CDR-grafted variable 
region containing ZCE-025 heavy chain CDRs, the approximately 
500 base pair DNA fragment was ligated into a TA holding 
vector as per the manufacturer's protocol (In Vitrogen, San 
Diego) . TA vectors are provided by the manufacturer as 
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linear molecules containing a single deoxythymidylate as an 
overhang on each of the vector's 3- ends. This is 
complementary to the deoxyadenylate overhangs found on the 3- 
ends of pgr products due to the terminal transferase activity 
of Taq polymerase. 

TA clones containing inserts of the correct size 
(about 500 base pairs) were identified by EcoRi restriction 
digests of DMA minipreps using methods known in the art Up 
to ten clones with appropriate insert sizes were sequenced on 

JTT ° NA S6qUenCer (DUP ° nt ' Delawa ~< »>- A clone 
with the appropriate sequence was digested to completion with 

Preset artL 10 ^ 6 ^ 16336 - ™ S site was 

present at the 5 and 3 • ends of the hZCE-csv L gene for 

« t0 ^* final 6Xpression ~tor as described in 
Kx^le io. below. The hZCE-CSV L fragment was isolated 

211^1 e ' eCtr0ph0resis usin * tb. gel purification method 

£ re—el* • Palpitation, the fragment 

was resuspended in sterile distilled H 2 0 and the 

concentration was determined by running a small aliquot on a 
gel, as described previously. 



25 



30 



35 



a. Construction of pGXMJk/hZCSfCSVx.) -kappa. 

hzcE-csv J** " 4 * SfiI t0 SfiI fra9ment containing the 
nZCE CSV L region was combined with a 9 kb SfiI to SfiI 

fragment isolated from the pGD!9 kappa expression vecC£)r 
standard ligation (Sambrook, eC al.,. As shown in 

18 ' the resultin * expression vector. 
PGIM9k/h2CE(csv L , -kappa contained the following components: 



(1) 



(2) 



Human m-9 kappa promoter, signal exon 1 
and signal intron (up to added SfiI site) 
The hZCE(csv L , gene beginning with an SfiI 
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site in the signal intron and including the 
PGIM9 kappa signal exon II hZCE(csv L ) region 
and extending to an Sfil site at beginning of 
the major intron. 

Human IM-9 kappa major intron (from Sfil 
site), kappa constant exon and 3' flanking 
sequences (containing native polyadenylation 
site) . 

XGPRT gene under the control of an 
enhancerless SV40 early promoter. 
Bacterial plasmid origin of replication, 
derived from pBR 322. 
Bacterial S- lactamase, driven off its 
native promoter. 



b. Transfection of SP 2/0 and hZCBK. 

vector pGIM9k/h2CE(CSV L ) -kappa, on deposit 
with ATCC under the provisions of the Budapest Treaty Deposit 
no. 75530, was electroporated into two different host cell 
lines, SP 2/0 and hzcEk. hZCEk is a transfectoma derived 
from SP 2/0 by transfection with the vector P GlM9k/hZCE- 
kappa, which expresses CDR grafted ZCE/IM-9 light chain 
(hZCEK-homodimer) [Example 8.e.]. For SP 2/0, 
pGiM9k/hZCE(CSV L > -kappa was electroporated together with the 
drug selectable gene neo in the vector pSV2Neo, and 
transfectants were selected by growth in HH4 medium 
containing 1.5 mg/ml geneticin (Bethesda Research Labs/Gibco, 
Gaithersberg, MD). For hZCEk, pGIM9k/hZCE(csv L ) -kappa was 
also co-electroporated with pSV2neo to allow selection of 
transfectants in medium with geneticin 1.5 mg/ml. 
Electroporation conditions and selection media recipes were 
as described by Chu, et al. (Nucleic Acids 15:1311- 
1325 (1987)). Briefly, the SP2/0 cells were grown in media 
containing 10% FBS and were maintained in log phase growth 
for the three days preceding electroporation. Fifty 
micrograms of the plasmid vector was linearized using the 
restriction enzyme Pvul (1 unit/fig) and the Reaction Buffer 
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#7 from GIBCO-BRL (Gaithersburg, MD) . At the time of 
transfection the SP2/0 cells were collected by centrifugation 
m an iec clinical centrifuge (800 rpm, 10 min, room 
temperature) . Cells were washed in Hanks Buffered Saline 
Solution from Gibco Laboratories (Grand island. NY) 
containing an additional 6 mM dextrose and resuspended at a 
final concentration of 1.0 x 10? cells/ml. 0.5 ml of cells 
were aliquoted into cuvettes and the linearized DNA was 
added. Electroporation was done using the Cell-Porator® 
(GIBCO-BRL) with settings of 300 »iP and 350 volts. 

c Selection and character! sat ion of hZCB-kb 
expressing clones. 

Resistant clones of each host cell line were 
identified by growth on appropriate selective media and 
assayed for hZCE(csv L ) chain production , SP 2/0 host) and cea 
binding (hZCEk host) activity as described in Bxaa Ple 15 
shown below. The resultant clones were called hZCEhb (SP 2/0 
host) and hZCEkb (hZCEk host). hZCEhb produces only the 
human kappa light chain with ZCE heavy chain CDRs secreted as 
a homodimer, while hZCEkb produces a human light chain dimer 
with one kappa chain containing ZCE heavy chain CDR-s and the 
other containing ZCE light chain CDRs. a conventional human 
kappa elisa can be used to quantitate production levels of 
the homodimer from hZCEhb. but a CBA-binding ELISA is 
required to quantitate the antigen binding heterodimer 
hZCEkb. The hZCEkb chain or hZCEhb chain were secreted as 

^ h2CEhb h0ro0di * er di * not bind CEA, while the 
hZCEkb had affinity for CEA. 
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Example 11 

CgnotructiQD of nzcBfcsv^) ggnrttajlan vector «m 
BxaxAflAloa of hzcBfCSVi) 

a. Preparation of gene for chelating peptide. 

A CDR switched variable region isolate was 
constructed as a variation of the hZCE(CSV L ) kappa chain 

where the human kappa constant region would be deleted so as 
to express the hZCE(CSV L ) light chain domain only. To screen 
for the CDR switched isolate construct, it was desirable to 
express it as a fusion protein containing a metal chelating 
peptide for purification. The gene encoding the chelating 
peptide was prepared by creating a DNA fragment which would 
ultimately replace the human kappa constant exon in the 
pGIM9k/hZCE(CSV L ) -kappa vector. Using PCR techniques, an 
approximately 330 base pair Mstll/Mstll modified fragment 
(Fragment A) was prepared using the pGIM9k/hZCE(CSV L ) -kappa 
expression vector as template. The upstream primer in this 
PCR reaction was B1000 (SBQ. I.D. Mo. 49) 5'-CAC-CAT CCT 
GTT TGC TTC TTT CCT CAG GAA CTG TGC ACT GGC ACC ACC ACC CAT 
AGA GGG AGA AGT GCC CCC ACC TGC TCC TCA GTT -3 ' , which 
included the codons for a 6-amino acid chelating peptide, and 
the downstream primer was B441 (SBQ. I.D. No. 50) 5'- 
GGGTAAAAATAGAATG AAGGATGAT -TTTTATAAAT - 3 ■ . Fragment A 
consisted 5' to 3* of (1) an MSTII restriction site and the 
splice acceptor site from the IM9 kappa constant region; (2) 
the codons for the first three amino acids of the kappa 
constant region; (3) the codons for a six amino acid 
chelating peptide sequence (HWHHHP) and a termination codon; 
and (4) 3' untranslated sequence including the 
polyadenylation site and native MSTll-restriction site. 

b. Construction of pG^k/HZCBtCSVi.) expression 
vectior . 

Fragment A and pGIM9k/hZCE (CSV L ) -kappa were 

digested with either Mstll or Bsu36-1 (Stratagene, 10X 
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Universal buffer. 37'c for a minimum of 3 hours) to produce 
ligatable ends. Fragments (-330 bp of Fragment A and -12 8 
kb pGlM9k/hZCE{ C SV L ) -kappa were thus isolated and purified 
using Milligen's Ultrafree-MC (Yonezawa. Japan) method. 
Ligation was carried out using components and ligation 
conditions from a TA Cloning Kit (Invitrogen, San Diego, CA) 
following the manufacturer's protocol. Electroporation into 
Electromax DH10B cells (BRL, Gaithersburg , MD) was performed. 
Transformed cells were plated onto agar, incubated overnight, 
and colonies were grown-up for plasmid mini preps using 
Qiagen's (Chatsworth, CA) -Mini Plasmid- protocol. Construct ■ 
size was verified by restriction digest analysis using EcoRl. 
Mstll or Bsu36-1, Sstl, and BamHl enzymes. 

Large scale plasmid preparations were performed 
using Qiagen-s -Maxi -plasmid- prep procedure. DNA sequencing 
was performed to verify the correct sequence, which is called 
hZCE(csv L ) (no i.d. ho. 51). The cloned plasmid herein is 
called pGlM9k/hZCE(CSV L ) . 

SBQ. I.D. No. 51 

GAC ATC CAG ATG ACC CAG TTT CCT TCC ACC CTG TCC GCC TCC GTG 
GGC GAC CGC GTG AAC ATC ACC TGC CGC GCC TCC GGC TCC ACC TTC 
TCC AAC TTC GGC ATG CAC TGG ATC CGC CAG AAG CCC GGC AAG GGC 
CTG AAG TGG GTG GCC TAC ATC TCC GGC GGC TCC TCC ACC GTG CAC 
TAC GCC AAC TCC CTG AAG GGC CGC TTC ACC ATC TCC CGC GAC AAC 
CCC AAG AAC GAG CTG TTC CTG ACC ATC ACC TCC CTG CAG CCC GAC 
GAC TTC GCC ATG TAC TAC TGC GCC CGC GAC TAC TAC GTG AAC AAC 
TAC TGG TAC TTC GAC GTG TGG GGC CAA GGG ACC AAG GTG GAA ATC 
AAA 

C. Expression of hZCB(CSVx,) . 

Linearization of pGIM9k/hZCE (CSV L ) DNA was 
performed via Clai digestion. Electroporation into SP2/0 
cells was performed as previously described in Example 10. b. 
Cells were seeded in HH4 medium supplemented with 10% FCS 
Three days later, cells were plated 6 2 x 10 © 5/ml in 24- 



WO 96/06625 



PCT/US95/10791 



- 106 - 

well cluster plates in the presence of HH4, 10% FCS, MAX (MAX 
= 1.0 »is/ml mycophenolic acid plus 100 jis/ml xanthine). At 
day 14 after plating, colonies were harvested and transferred 
to 6 -well plates (Falcon) for expansion and serum-free medium 
adaptation. Clones were successfully expanded and adapted to 
serum-free conditions within 2 weeks. 



example 12 



CgRBtraction of BHimt/hgcg-an^,. 

a. Construction of hzcB heavy chain variable exon. 

The protein sequence of the heavy chain of nZCE was 
converted to nucleic acid sequence in the following manner: 
(1) if the amino acid was derived from ZCE, the actual ZCE 
codon at the site was used; (2) if the amino acid was derived 
from IM9, the actual IM9 codon at the site was used; (3) if 
the amino acid was derived from a consensus sequence, any 
appropriate codon was used. 

The hZCE gamma variable exon (seq. i.e. NO. 58) 
shown below was obtained by PCR reactions. 

SBQ X.D. MO. 58 

GAA ATG CAA CTG GTG GAA TCT GGG GGA GGC CTG CTA CAG CCT GGC 
CGG GCC CTG CGG CTC TCC TGT GCA GCC TCT GGA TTC ACT TTT AGT 
AAC TTT GGA ATG CAC TGG ATT CGG CAA ACT CCA GGG AAG GGC CTG 
GAG TGG GTC GCA TAC ATT AGT GGT GGC AGT AGT ACC GTC CAC TAT 
GCA GAC TCC TTG AAG GGC CGA TTC ACC ATC TCC CGG GAC AAC GCC 
AAG AAC TCC CTC TAT TTG CAA ATG ACC AGT CTC CGG GCT GAG GAC 
ACG GCC TTG TAT TAC TGT GCA CGG GAT TAC TAC GTT AAT AAC TAC 
TGG TAC TTC GAT GTC TGG GGC CAA GGG ACA ATG GTC ATC GTC TCT 
TCA G 

Five overlapping oligonucleotides, B156. risq, B l9fi. m<>7 
and fH9ft (SBQ. I.D. NO. 60-64) 
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SBQ I.D. Ho. 60 



-GAT CCG AAA TGC AAC TGG TGG AAT CTG GGG GAG GCC TGC TAC 

S Z " ^ "* ^ CAG CCT CTG GAT TCA 

5 CCT TTA G-3' 



SBQ I.D. M o. 61 

10 ^ CAC ^ CGA CCC ACT CCA GGC CCT T «=C CTG GAG 

10 TTT GCC GAA TCC AGT GCA TTC CAA AGT TAC TAA AGG TGA ATC CAG 
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SBQ I.D. No. 62 

™ ™ «=» CT * °™ °* TCC ACT ATG 

CAG ACT CCT TGA AGG GCC GAT TCA CCA TCT CCC GGG ACA ACG CCA 
AGA A3 

SBQ I.D. NO. 63 

5- -TAT TAC TGT GCA CGG GAT TAC TAC GTT AAT AAC TAC TGG TAC 
TTC GAT GTC TGG GGC CCA GGG ACA ATG GTC ATC GTC TCT TCA -3- 



SBQ I.D. No. 64 

25 ZZfZ theSiZed " 3 ^ SyntheSi2er <Hillipore) following 

Zl real! lnSCrUCtiOI1S - ™<* — f-ed together by a 
PCR reaction using filfii {SBQ . X . D . ^ fi5) 5 , _^ G . GAT ^ 

AAA TGC AAC TGG TGG AAT CT -3 ' and £1^2, (SBQ. I.D. NO 66, 
GAC GAA TTC TGA AGA GAC GAT GAC CAT TG a S the end primers. 
The resulting fused fragment was cloned into pCR^n 
(invitrogen, and the sequence was verified as described in 
Step 2.1. j. 

b. Construction of the hzew ^ 

wie nzcB gamma expression vector, 
pNIM9k/hZCB- gamma (cDNA) . 

The hZCE heavy variable exon and the entire IM9 
gamma constant region (from 5- im9 heavy CHI exon to the 
BstEIl site 3. of the CH3 exon, were fused together by an 
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overlap PCR reaction. Two PCR reactions were performed: the 
first PCR reaction used the pCRll clone from 3* a. as template 
and primers B611 and B612. The PCR product was reamplified 
with primers B467 and B567. The second PCR reaction used 
5 primers B566 and B514. The products of these two reactions 
were used together as templates in the overlap reaction with 
primers B467 and B514. The resulting fusion fragment of hzCE 
heavy variable exon and 5' IM9 heavy CHI exon to BstEIl was 
cloned into pCR™ll (Invitrogen) and the sequence was verified 

10 as described in Step 3.1. The resulting vector is 
phZCE/CHlBstEII. 

A pair of oligonucleotides, B743 and B744, were 
designed to add the splice recognition site and the Sfil site 
3' of the variable region. The IM9 heavy chain cDNA vector 

15 was digested with BamHl and Hindlll, extracted with phenol 
and chloroform mixture, precipitated with EtOH, and 
resuspended in TE. Primers B743 and B744 were kinsed, 
annealed together, and ligated with the digested vector. The 
ligation reaction was used to transform E. coli DH10B by 

20 electroporation. The, colonies were picked for analysis by 
restriction enzyme mapping and the resulting vector is 
pIM9gammacDNASf il . 

The phZCE/CHlBstEII vector and pIM9gamraacDNASf il 
were digested with Sfil and BstEIl. The 740 bp fragment from 

25 phZCE/CHlBstEII and the 950 bp fragment were purified by 
agarose gel electrophoresis. The pGIM9kappa vector was 
digested with Sfil and the 12 Kb fragment was purified by 
agarose gel electrophoresis. The three purified fragments 
were ligated and used to transform E. coli DH10B by 

30 electroporation. The colonies were picked for analysis by 
restriction enzyme mapping. The resulting vector is 
pGIM9k/hZCE-gamma . 

The Neomycin resistance gene was inserted into 
PGIM9 kappa vector to make pNIM9 kappa. Both the pGIM9kF2 and 

35 the pSV2neo vectors were digested by Apal and Pvul, the 5 Kb 
Neomycin resistance gene-containing fragment from the pSV2neo 
digest and the 9 Kb fragment from the pGIM9k digest were 
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purified by gel electrophoresis. The two fragments were 
ligated and used to transform E. coli DH10B by 
electroporation. The colonies were analyzed by restriction 
enzyme mapping, the resulting plasmid is pNDtfkappa. 

Both the P NlM9kappa and the pG(lM9k) /hZCE-gamma 
vectors were digested with Sfil, the 9 Kb and 5 Kb fragments 
from P NlM9kappa and the 1.6 Kb fragment from 
pG(lM9k)/hzCEgamma were purified by agarose gel 
electrophoresis. The three purified fragments were ligated 
and used to transform E. coli DH10B. The colonies were 
Picked and analyzed by restriction enzyme mapping, the 
resultxng plasmid is P N(lM9k) /hZCE-gamma<cDNA) . 

rp „. , In anoCher variation using the variable kappa 
region from IM-9 containing ZCE heavy chain CDRs <hZCE<CSV L ) 
region), a human gamma heavy chain was constructed. 

a. Construction of pOIM9k/hZCB(CSV x .) -gamma 

Using the polymerase chain reaction (PCR) , a 2 l 
kilobase DNA fragment was amplified using primers B922 (SBQ. 
I.D. No. 52) 5--AAG-AGC-TCC-TGA-ACC-TCG-CGG-ACA-GTT-AA-3-) 

and B923 (5 -AAA-TCG-ATC-TCA-GGC-CTC-AGA-CTC-GGC-CTG-ACC-CGT- 
GGA-AA-3-, (SBQ . !.„. „, £rom & tngmm ^ 

pN^k/hZCE-gamma!. The 5- end of this fragment contained an 
Sst-1 restriction site and the 3 ■ end contained a Cla-l site 
A second cla-l to Sst-1 (3-, fragment of 8.5 kilobases ' 

C^ a variL^ e ne0myCiD 6 laCtamaSe 3ene *»* the *ZCE- 

CSV L variable region gene was ligated together with the PCR 

generated 2.1 kilobase fragment. The 10.6 kilobase plasmid 
resulting from this ligation was reopened with Sst-1 
restriction endonuclease and ligated together with a 2.2 
kilobase sst-1 fragment from P GIM9kappa containing a portion 
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of the human kappa major intron with enhancer. The final 
expression vector is 12*8 kilobases and called 
pNIM9k/hZCE (CSV L ) -gamma . 

5 b. Expression of hZCB (CSV L ) -intact kappabody. 

The pNIM9k/hZCE{CSVi,) -gamma expression vector was 
electroporated (as described in Example 10. b. ) into cells 
expressing the pGIM9k/hZCE-kappa gene. Three days following 
electroporation the cells were put under drug selection 
10 ( genet icin 1.5 mg/ml) and colonies which grew up under this 
selection were analyzed for secretion of hZCE(CSV L ) -intact 

antibody. 

Protocol for Subcloning Trans fee tomas 

Individual wells from the initial screening for 
15 cells secreting the highest levels of immunoglobulin were 

further subcloned to insure a single clone had been selected. 
Briefly, the cells were diluted to 10, 5 or 0.3 cells per 
200jil and plated into two 96-well tissue culture plates at 
each dilution. The medium is HH4 with 10% fetal calf serum, 
20 100 ug/ml xanthine and the appropriate selection drug. After 
fourteen days individual wells were visually screened for 
single colonies, then harvested and cultured further so as to 
obtain a quantitative ELISA value as described in 
Example 15, below. 

25 

BXttfflPlfi 14 



Cloning and BxpgaaBion off a Single Chain Ez Containing 

a Cfflfc Fragment 

30 

a* Construction of pGlM9k/hzCE (CSv^) -ScPv expression 
vector. 

To construct an expression vector for a CSV L , the 
earlier expression vector pGIM9k/hZCE-kappa was reconstructed 
35 to contain the cdr-grafted kappa variable region in place of 
the human kappa constant region. In addition, the vector 
contained a 5' extension to the kappa variable region to 
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serve as a linker (L) between variable regions as well as a 
3' chelating peptide (CP) sequence. Therefore, 
diagramatically, the linear construct is as follows: 

The kappa variable region with 5' linker and 3' 
chelating peptide was synthesized in three separate PCR 
reactions. The first DNA fragment (Fragment- 1) of 457 base 
pairs was amplified from vector pGIM9k/hZCE-kappa 
(Example 10. a.) with a 5 ; primer (C101) (SBQ I.D. No. 54) 
5' -CTG TTT GCT TCT TTC CTC AGG AGG CGG TTC AGG AGG ATC AGG 
CGG TTC AGG TGG ATC AGG AGG CGA CAT CCA GAT GAC CCA GTC TCC 

T-3' containing the Mstll restriction site, the linker (G-G- 
S) 4 GG and the first 24 bases of the constant kappa gene 
(Example 10.a.) ; and a 3' primer C102 [SBQ I.D. No. 55] 
5'-GTC AGG CTG GAA CTG AGG AGC AGG TGG GGG CAC TTC TCC CTC 
15 TAT GGG TGA TGG TGC CAA TGT TTG ATT TCC ACC TTG GTC CCT TGG 
CCG -AA-3' containing the bases of the 3' end of the kappa 
constant region, a H-W-H-H-H-P chelating peptide and stop 
codon. The second DNA fragment (Pragment-2) of 335 base 
pairs was generated using 5' primer C103 [SBQ i.d. No. 56] 
5 ' -GAG AAG TGC CCC CAC CTG CTC CTC AGT TCC AGC CTG ACC CCC 
TCC CAT CCT -3' and 3' primer B441 [SBQ I.d. No. 50] and 
the same template as above, i.e. pGGhZCE-HB. This Fragment 
contained the 3 • human kappa constant region containing the 
polyadenylation signal. 

The final DNA fragment (Fragment-3) was amplified 
using Fragment-1 and Fragment-2 as template and 5- primer 
C101 [sbq i.d. no. 54] and 3' primer B441 [SBQ i.d. no. 
50] to give the approximately 800 base pair Fragment-3. This 
Fragment-3 was cloned into a TA vector for confirmation of 
DNA sequence as described in Bxaaple 9.b. Following 
confirmation of sequence the Fragment-3 insert was re- 
isolated from the TA vector as an Mstll fragment and cloned 
into the vector pGIM9k/hZCE-hb (which had its Mstll fragment, 
containing the human kappa constant region, deleted) . All PCR 
amplifications were carried out as described in Example 9. a. 



20 



5 



WO 96/06625 



PCMJS95/10791 



- 112 - 



b. Expression of bZCEtCSVr,) -ScPv. 

After cloning and scale up. the final expression 
vector, herein called pGhZCE-csv L -sFV, was electroporated 
into SP2/0 hybridoma cells as described in Example 10. b. 
Clones secreting the CSV L -sFV construct were identified as 
described in Example 15. f., below. Finally, the affinity 
of the construct was analyzed via a competitive inhibition 
assay as described in Example 15. e, below. 

Bamaait is 

Identification. miant- N ation and afflwlfrv 

flfltera i aatlonof enoine«r«d n.™**-^** ngaflasflfl 

a. Identification and quantitation of secreted 

hZCB-CSVL- kappa bomodimer and hZB(CSVL) /liZCB- 
kappa heterodimera 

Identification and quantitation of secreted CDR 
grafted human kappa chains from transfected SP 2/0 cells 
expressing hzcE kappa homodimer. and those expressing hZCE- 
CSV L homodimer were identified by a standard enzyme-linked 
immunosorbent assay ( "ELISA" , as described by Engvall, E. and 
Perlmann. p., Immunochfimi srrY. 8:871-874 (1971)) for human 
kappa. The purpose of this assay was to identify those cells 
secreting the highest levels of kappa chain polypeptide coded 
for by pGlM9k/hZCE-kappa or pGlM9k/hZCE(CSV L ) -kappa plasmid 
vector. A 5>ig/ml solution of goat anti -human kappa chain 
(Tago #4106, Tago Inc., Burlingame. CA) in lOmM sodium 
phosphate pH 7.4 was prepared. Each well of a 96 well plate 
was coated with 50*11 of this solution. The plates were then 
incubated overnight at 37 *c. Plates were then rinsed 
thoroughly in H 2 0, and then PBS with 1.0% Tween-20™ ( w / v) . 
Fifty ul of the supernatant fractions were added to each 
well, and incubated for two hours at room temperature. 
Plates were again rinsed as detailed above. A goat anti- 
human kappa chain alkaline phosphatase conjugate (Tago #2496. 
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Tago. inc.) was diluted 1:1000 in the same medium as the 
supernatant material. 100*11 were added per well and allowed 
to incubate for one hour at room temperature. Plates were 
rinsed as above. The alkaline phosphatase substrate 
5 (Hybritech. inc., San Diego, CA Part #100103) was prepared as 
per package instruction, one tablet per 3ml of distilled H 2 0 
and 1S0U1 of this substrate was added to each well and 
allowed to incubate 30 minutes at 37'c. The reaction was 
quenched with 50ul of 300 mM EDTA and then the absorbance was 
10 read at 405 nM. Colonies, whose supernatants showed the 

highest levels of kappa expression, were subcloned and cryo- 
preserved. Expression levels are shown in Table 3. 

b. Identification and quantitation of hzCEtCSV^) - 
15 intact kappabodiea. 

Detection of assembled hZCE (CSV L ) -intact 
kappabodies was carried out by coating the microtiter plate 
wells with goat anti-human igo heavy chain antibody reagent 
(Tago #3100, Tago, inc., 887 Mitten Road, Burlingame, CA) at 
5 ug/ml i„ io mM phosphate pH 7 to 8. Plates were dried 

IZTtT tr C ' then With PBS and Tween-20TM, 

then H 2 0. Fifty microliters of the cell supernatant were 

added to each well and incubated for 2 hours at room 
temperature. Plates were again rinsed as detailed above a 
goat anti-human kappa chain alkaline phosphatase conjugate 
(Tago #2496 Tago, inc.. 887 Mitten Road, Burlingame, CA) was 
diluted 1:1000 in the same medium as the supernatant 
material. 100 ul were added per well and allowed to incubate 
for 1 hour at room temperature. Plates were rinsed as above 
The alkaline phosphatase substrate (Hybritech) , one tablet 
per 3 ml of distilled H 2 0, and 150 ftl of this substrate was 
added to each well and allowed to incubate 30 minutes at 
37 c. Purified protein, IgG^kappa, from the human 
lymphoblastoid cell line IM9 was used as a positive control 
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c. ELI S A for detecting the presence of hZCB(CSV L ) 

constructs bound to Car cino -Embryonic Antigen 
( CBA ) . 

To detect the hzcE(CSV L ) constructs which can bind 
CEA, ELISAs were performed as follows: 

On the first day CEA stock standard (lmg/ml) 
(Hybritech Part # 211288) was diluted to 10 ug/ml in PBS with 
lmg/ml BSA in a final volume of 6 mL for each ELISA plate. 
96-well ELISA plates (Titertek, McLean, VA) were coated at 
50ul/well, tapped to ensure that all well bottoms were 
completely covered, and incubated overnight at 37 *C. 

On the second day the plates were washed twice with 
distilled, deionized water, twice with 1XPBS+0.1% Tween-20 T «, 
and twice again with distilled, deionized water. Samples 
containing the h2CE(CSV L ) -heterodimer, hZCE(CSV L ) -intact, and 
standards were added to the plates at 50|il/well. Plates were 
then sealed and incubated at room temperature for 2 hours. 
Goat Anti- (Human Kappa) conjugated with alkaline phosphatase 
(Tago # 2496 Burlingame, CA) was diluted 1:1000 in RPMI 
medium (Gibco) with 10% horse serum and 3% goat serum to a 
volume of lOml/plate. Plates were washed as before, and the 
anti-Kappa conjugate was added to the plates at 50ul/well. 
Then the plates were sealed and incubated at room temperature 
on a shaker at -100 rpm. for 1 hour. 

PNPP Alkaline Phosphatase tablets (Hybritech Part 
#100103) were dissolved in distilled, deionized water at a 
ratio of 1 tablet per 3ml of water, and lSOjil/well of the 
alkaline phosphatase solution was added to the plates. The 
plates were incubated at 37 'C for half an hour and then 
absorbencies were read at 405nm using a CERES900 ELISA reader 
(BioTek. inc., Winoosk, Vermont). The cultures corresponding 
to the wells whose supernatants yielded the highest optical 
densities were selected for further scale up and ELISA 
quantitation. 
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d. 



10 



15 



Competitive inhibition BLISA for quantifying CBA 
binding of hZCB CSV L heterodiaere. 

The binding affinity of the hZCE (CSV L ) -heterodimers 
for carcinoembryonic antigen was quantified as follows: 

On the first day the substrate antibody was 
prepared. Briefly, CEV124.1, a murine monoclonal anti-CEA 
antibody obtained from Hybritech (San Diego, CA) was diluted 
1:1000 in phosphate buffered saline (PBS) to a final volume 
of 6 «l. The pbs was prepared by mixing 1494 g Nad, 36 g 
KC1. 36 g kh 2 P0 4 , and QS to 18L H 2 0, then diluted 1:10 with 
distilled, deionized water, a 96 well plate was coated with 
the antibody-containing solution using about 50ul/well. The 
Plate was tapped to ensure that each entire wall bottom was 
covered. The plate was sealed and left at room temperature 
overnight. The next day the CEA antigen was prepared as 
described in Example 15. c. 

The plates containing bound antibody were washed 
four times with distilled, deionized water, and 50,11 of the 
Cea/bsa antigen-containing solution was dispensed into each 
20 well The plates were sealed and placed on a rotator shaking 
at -300 rpm. for 2 hr. Finally the plates were washed as 
before. 

A supernatant of hZCE (CSV L ) -heterodimer was loaded 
at 50m/well. a standard curve was generated by diluting a 
> 10>ig/ml solution of XCEM F(ab) ' or ZCB Fab- at 1:2 increments 
along the top row of the assay plate. The XCEM chimeric 
antibody was described in Beidler. C.B.. etal., -Cloning and 
High Level Expression of a Chimeric Antibody with Specificity 

for Human Carcinoembryonic Antigen. - J. of t 1 

) 141:4053-4060(1988). Plates were sealed and incubated on a 
rotator as before for 45 minutes to allow the test antibody 
to bind to the antigen. 

For use as the competition antibody, biotinylated 
Flab)' fragments of XCEM chimeric monoclonal antibody or ZCE 
Fab' were prepared. Biotinylation was conducted as described 
by Enzotin Biochem. Inc.. New York, NY. The biotinylated 
fragments were diluted to a final concentration of 0.4ug/ml 
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(experimentally determined to give an 0D 490 of about 0.6) 
and 50(1L of the solution was added to each well without 
washing the plate. The plate was sealed and incubated as 
before for 45 minutes and then washed as before. 
5 Streptavidin/horse radish peroxidase (Fisher Biotech, 

Pittsburgh, PA) conjugate was prepared as per manufacturer's 
directions and then diluted in IX PBS with 1% BSA. Fifty jil 
of the streptavidin labeled conjugate were added to each 
well, the plate was sealed, and incubated as before for 45 

10 minutes. Finally the plate was washed as before to remove 
the unbound conjugate. 

Final substrate was made by completely dissolving 
one lOmg tablet of o-phenylenediamine dihydrochloride (Sigma 
#P8287, St. Louis, MO.) into 10 ml of PCB (18.45g Citric 

15 Acid (monohydrate), 25.86g Na 2 HP0 4 , bring to 1.81 @ MilliQ 
H 2 0, pH to 5.0, QS to 2L) , then adding 15jll of 30%H 2 O 2 . 
lOOjil of substrate was added to each well, when a standard 
curve could be visualized, the plates were quenched by adding 
50m of 4M H2SO4 to each well. The assay was read on the 

20 Biotek CERES 9 00 assay reader at absorbance of 490 nM. 
Concentration calculations were done using the built-in 
software *Kineticalc Jr." from BioTek Instruments (Winoosk, 
Vermont) . Results of these experiments are shown in Table 3 
below. 

25 

e. Competitive inhibition assay for determination of 
affinity of anti-CBA antibodies and constructs. 

Affinities of unlabeled recombinant antibodies were 
determined by a modification of the method described by H. 
30 Motulsky and L. Mahan, Molecular Pharmacology. 21:1-9, 1983). 
This method can measure the affinity of unlabeled antibodies 
by evaluating their ability to inhibit the binding of a 
labeled tracer antibody which reacts with the same epitope of 
an antigen. 

35 Tandem® R CEA Beads (Hybritech #600211), which 

contain the mouse antiCEA antibody CEV124, were put into 13cm 
x 75cm polystyrene tubes (1 bead per tube) and incubated with 
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lOOmg of CEA, diluted in 1% BSA/PBS solution to a final 
volume of lOOul, for 2-5 hours at room temperature. The 
source of the CEA used for these experiments is CEA Stock 
Standard Solution (Rybritech, #200288) . The beads were then 
5 washed twice with 2ml of 0.1% Tween20™ in phosphate buffered 
saline just prior to adding the antibodies for affinity 
testing. 

The tracer antibody is a isothiobenzyl-DTPA 
conjugate of ZCE025 Fab' fragment labeled with 3uCi of Ulln 

10 Citrate per microgram of Fab'. The tracer is first titrated 
for binding to the above CEA beads to determine a 40-60% 
saturation point. This concentration of tracer (usually 1.5 
x 10-9 M) is used for all the following inhibition reactions. 
Varying concentrations of unlabeled XCEM or supernatant 

15 containing hZCE-CSV L heterodimer were added (lOOul) to the 
CEA beads at 2X their final concentrations (final is 1 x 10 ~ 7 
M down to 1 x 10-H M, diluted in 1%BSA/PBS) together with an 
equal volume of the 2X tracer (100 nl). The reaction was 
then incubated overnight at room temperature on an Orbital 

20 Shaker (150-200 RPM) . 

f. Identification and ouantif ication of hZCE(CSV L ) 
isolate. 

cells putatively secreting hZCE(CSV L ) -isolate were 
25 seeded at 4 x 105/ml in serum-free HR4 medium containing 100 
Jig /ml xanthine and 1.0 ug/ml mycophenolic acid. When cell 
numbers reached -1 x 10* /ml, 1.0 ml of their supernatants 
were collected and mixed with 100 |U of Mi+2- loaded nitrilo 
acetic acid agarose beads (Qiagen, Inc., Chatsworth, CA) . 
The beads and conditioned cell supernatant from 24 individual 
clones were incubated for a minimum of four hours on a 
rotating wheel at room temperature. The beads were washed 3 
times with 50 mM sodium phosphate, lOOmM sodium chloride 
buffer, pH 7.4. Bound protein was eluted from the beads by 
35 addition of 100 \ll of SDS-PAGE reduced sample buffer. The 

elutate was electrophoresed on 15 - 20% SDS-PAGE gels and the 
gels were silver stained to visualize and quantitate the 



30 
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hZCE-CSV L -isolate. The SDS-PAGE gels, buffers and silver 
staining kit were carried out using reagents from Biorad, 
(Richmond, CA) according to the manufacturer's instructions. 
Results are shown in Table 3 below. 

TABLB 3 



Conufcrnfff 



Secreted Protein 
UaZmJ* 



Affinity 
1/M 



M.W. 



mzCE Fab 



EP 



2 x 109 



52 



hZCE-kappa 
homodimer 

hZCE(CSV L ) -kappa 
homodimer 

hZCE(CSV L ) -kappa 
heterodimer 

hZCE(CSV L )- 
intact 



20 



20 



20 



2 x 109 



ND 



50 



54 



52 



160 



hZCE(CSV L ) 

1 ND 18 

* = average 
BP « enzymatically produced 
ND . not determined 

The foregoing description of the invention is 
exemplary for purposes of illustration and explanation. It 
should be understood that various modifications can be made 
without departing from the spirit and scope of the invention. 
Accordingly, the following claims are intended to be 
interpreted to embrace all such modifications. 
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GGSGGSGGSGGSGG {14 RESIDUES) (SBQ I.D. No. 1) 
HWHHHP (6 RESIDUES) (SBQ I.D. No. 2) 

5 5' GAC TAG CGG CCG CAT CGA TCC CCC CCC CCC CCC C (SBQ. I.D. 
NO. 3) 

5'CAG ACG TCG ACG ATG GAT ACA GTT GGT GCA GCA TC (SBQ. I.D. 
NO. 4) 

0 

SBQ. I.D. NO. 5 

ZCE-025 Light Chain Variable cDNA 
GAC ATT GTG ATG ACC CAG TCT CAA AAA TTT ATG* TCC ACA TCA GTT 
GGA GAC AGG GTC AAC ATC ACC TGC AAG GCC ACT CAG AAT GTT COT 

5 ACT GOT GTA GCC TGG TAT CAA CAG AAA CCA GGG CAG TCT CCT AAA 
GCA CTG ATT TAC TTG GCA TCC AAC CGG TAC ACT GGA GTC CCT GAT 
CGC TTC ACA GGC ATT GGA TCT GGG ACA GAT TTC ACG CTC ATC ATT 
AGC AAT GTG CAA TCT GAA GAC CTG GCA GAT TAT TTC TGT CTG CAA 
CAT TGG AAT TAT CCT CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG 

• CTG AAA C 
381 

SBQ. X.D. No. 6 

DIVMTQSQKFMSTSVGDRVNITCKASQNVRTAVAWYQQKPGQSPKALIYLASNRYTGVPDR 
FTGIGSGTDFTLIISNVQSEDLADYFCLQHWNyPLTFGACTKLELK 



5'CAG ACG TCG ACG TTC CAG GTC ACT GTC ACT GGC TC (SBQ. I.D. 
NO. 7) 
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SBQ. I.D. NO* 8 

ZCE-Q25 Heavy Chain Variable cDNA Sequence: 



GAT 


GTG 


CAG 


CTG 


GTG 


GAG 


TCP 


GGG 


GGA 


GGC 


TTA 


GTG 


CCG 


CCT 


GGA 


GGG 


TCC 


CGG 


AAA 


CTC 


TCC 


TGT 


GCA 


GCC 


TCT 


GGA 


TTC 


ACT 


TTC 


AGT 


AAC 


ITT 


GGA ATG 


CAC 


TGG 


ATT 


CGT 


CAG 


GCT 


CCA 


GAG 


AAG 


GGA 


CTG 


GAG 


TGG 


GTC 


GCA 


TAC 


ATT 


AGT 


GGT 


GGC 


AGT 


AGT 


ACC 


GTC 


CAC 


TAT 


GCA 


GAC 


TCC 


TTG 


AAG 


GGC 


CGA 


TTC 


ACC 


ATC 


TCC 


AGA GAC 


AAT 


CCC 


AAG 


AAC 


ACC 


CTG 


TTC 


CTA 


CAA 


ATG 


ACC 


AGT 


CTA 


AGG 


TCT 


GAA GAC 


ACG 


GCC 


ATG 


TAT 


TAC 


TGT 


GCA 


AGA 


GAT 


TAC 


TAC 


GTT 


AAT 


AAC 


TAC 


TGG 


TAC 


TTC 


GAT 


GTC 


TGG GGC 


GCA 


GGG 


ACC 


ACG 


GTC 


ACC 


GTC 


TCC 



TCA G 
420 

15 

SBQ. I.D. NO. 9 

DVQLVESGGGLVPPGGSRKLSCAASGFTFSNFGMHWIRQAPEKGLEWVAYISGGSSTVHYA 
20 DSLKGRFTISRDNPKOTLFLQOTSLRSEOTAMYYCAWDYYV 

SBQ. ID NO 10 

GAC ATC CAG ATG ACC CAG TTT CCT TCC ACC CTG TCT GCT TCT GTA 
25 GGA GAC AGA GTC ACC 60 

ATC ACT TGT CGG GCC AGT CAG AGT ATT AGT GCC TGG TTG GCC TGG 
TAT CAG CAG AAA CCA 120 

GGG AAA GCC CCT AAA CTC CTG ATC TAT AAG GCG TCT AGT TTA GAA 
AGT GGG GTC CCA TCA 180 
30 AGG TTC AGC GGC AGT GGA TCT GGG ACA GAG TTC ACT CTC ACC ATC 
ACC AGC CTG CAG CCT 240 

GAT GAT TTT GCA ACT TAT TTC TGC CAA CAC TAT AAT CGA CCG TGG 
ACG TTC GGC CAA GGG 300 
ACC AAG GTG GAA ATC AAA GCA 
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5 



10 



IM9 Light Protein SBQ I.D. No . 11 
DIQ MTQ FPSTLSASVGDRVTITCRASQSISAWLAWYQQKPGKAPKLLIY 
KASSLESGVPSRFSGSGSGTEPTLTITSLQPDDFATYFCQHYNRPWTFGQGTKVEIK 



SBQ. ID MO. 12 



GAA ATG CAA CTG GTG GAA TTT GGG GGA GGC CTG CTA CAG CCT GGC 
AGG GCC CTG AGA CTC 60 

TCC TGT GCA GCC TCT GGA TTC AGG TTT GAT GAT TAT GCC ATG CAC 
TGG GTC CGG CAA ACT 120 

CCA GGG AAG GGC CTG GAG TGG GTC GCA GOT ATT ACT TGG AAT ACT 
GAC ACC ATA GAC TAT 180 

GCG GAC TCT GTG AAG GGC CGA TTC ACC ATC TCC AGA GAC AAC GCC 
15 AAG AAC TCC CTC TAT 240 

TTG CAA ATG AAC ACT CTC AGA GCT GAG GAC ACG GCC TTG TAT TAC 
TGT ACA AAA AGA AGG 300 

^ ^ GAT ATC TGG CAA GGG ACA 

ATG GTC ATC GTC TCT 360 

20 TCA GAG 366 



25 



30 



35 



IM9 HEAVY PROTSXN SBQ I.D. Mo . 13 

EM^VEFGGGLLQPGPJUJlI^CAASGPRFDDYAMHWvltQTPGKGLEWVAGISWNSDTIDYA 
DSVKGP^ISRDNAKNSLYI^SLRAEOTALYYCTK^ 

SBQ I.D. no 14 

DIVMTQSPSSLSVSAGERVTMSCKSSQSLLMSGNQKNPLAWYQQKPGQPPKLLIYGASTRE 
SGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCQNDHSYPLTFGAGTKL 

SBQ I.D. No 15 

DWMTQTPLSLPVSLGDQAS IS CRS SQSLVHSQGNTYLRWYLQKPGQSPKVLI YKVSNRFS 
GVPDRFSGSGSGTDFTIJCISRVEAEDIjGVYFCSQSTHVPWTFGGGTKLE 

SBQ I.D. Ho 16 

DmTQSPAIMSASPGEKVTlfrCSASSS^mWQQKSOTSP^ 
SGSGSGTSYSLT1SSMETEDAAEYYCQQWGRNPTFGGGTKLEIK 
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SBQ I.D. HO 17 

DIQMTQSPASLSASVGETVTITCRASGNIHNY^ 
FSGSGSGTQYSLKINSLQPEDFGSYYCQHFWSTPRTFGGGTKLEIK 

SBQ I.D. HO 18 

5 EIVLTQSPAITAASLGQKVTITCSASSSVSSLHWYQQKSGTSPKPWIYEISKLASGVPARF 
SGSGSGTSYSLTINTMEAEDAAIYYCQQWTYPLITFGAGTKLELK 

SBQ I.D. Ho 19 
DIQMTQIPSSLSASLGDRVSISCRASQDINNFLNWYQQKPDGTIKLLIYFTSRSQSGVPSR 
FSGSGSGTDYSLTISNLEQEDIATYFCQQGNALPRTFGGGTKLiEIK 
10 SBQ I.D. Ho 20 

SVLTQPPSVSGAPGQRVTISCTGSSSNIGAGNHVKWYQQLPGTAPKLLIFHNNARFSVSKS 
GSSATLAITGLQAEDEADYYCQSYDRSLRVFGGGTKLTVL 

SBQ I.D. HO 21 
QSVLTQPPSASGTPGQRVTISCSGTSSNIGSSTVNWYQQLPGMAPKLLIYRDAMRPSGVP^ 
15 RFSGSKSGASASLAIGGLQSEDETDYYCAAWDVSLNAYVFGTGTKVTVL 

SBQ I.D. HO 22 
EVKLVESGGGLVQPGGSLRliSCATSGFTFSDFYMEW^QPPGKRLEWIA^ 
YSASVKGRFIVSRDTSQSILYLQMNAIJUEDTAIYYCARNYYGSTWY 

SBQ I.D. Ho 23 
20 EVKLDETGGGLVQPGRPMKLSCVASGFTFSDYWHNV^ 

YS DS VKGRFT I S RDDSKSSVYLQMNNLRVEDMG IYYCTG S YYGMDYWGQGT SVTVS S 

SBQ I.D. HO 24 
VQLQQSGAELMKPGASVKISCKASGYTFSDYWIElffl^ 
RFKGKATFTAOTSSSTAYMQLNSLTSEX>SGVYYCLHGNYDFDGWGQGTTL 
25 SBQ I.D. HO 25 

QVQLKESGPGLVAPSQSLSITCTVSGFSLTGYGVNV^ 
ALKS RLS ISKDNSKSQVFLKMNSIiHTDDTARYYCARERDYRI^ 

SBQ I.D. Ho 26 
EVKLLESGGGLVQPGGSLKLSCAASGFDFSKYWMSWTOQAPGKGLEWIGEIHPDSOT 
3 0 PSLKDKFIISRDNAKNSLYLQMSQVRSEDTALYYCARLHYYGYNAYWGQGTLVTVSA 

SBQ I.D. Ho. 27 
EVQLQQSGVELVRAGSSVKMSCKASGYTFTSNGIN^ 
EKFKGKTTLTVDKSSSTAYMQLRSLTSEDSAVYTCARSEW 

SBQ I.D. Ho 28 
3 5 VKLEQSGPGLVRPSQTLSLTCTVSGTSFDDYYSTWVRQ 
UISRVTMLVWTSKNQFSLRLSSVT^^ 
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SBQ I.O. No 29 

EVQLVQSGGGWQPGRSLRLSCSSSGFIPSSYAMYWVRQAPGKGLEWVAI1WDDGSDQHYA 
DSVKGRFTISRNDSKNTLFLQMDSLRPEDTGVYFCARDGGHGFCSSASCFGPDYWGQGTPV 



TVSS 



10 



20 



25 



30 



DIQMTQFPST LSASVGDRVN ITCRASGFTF SNFGMHWIRQ KPGKGLKWVA 

YISGGSSTVH YADSLKGRFT ISRDNPKNEL FLTITSLQPD D F AMY Y CARD 

YYVNNYWYFD VWGQGTKVEI KR (122 residues) (SBQ. I.D. NO. 
30) 



SBQ. I.D. NOB. 31 - TAGTGGATCCAACTGATTTCTCCAT 
SBQ. I.D. No. 32 - TTATTTACTTCTGGGTCACCAGGTTTATTC 
SBQ. I.D. NO. 33 - AAGAGGCCGAGCTGGCCCTTCCCTGAATAACCAGGCAGT 
SBQ. I.D. NO. 34 - GGGAAGGGCCAGCTCGGCGTGTTCCTATAATATGATCAA 
15 SBQ. I.D. No. 35 - TTCCTGGCCCTGCAGGCCCAGTTGTCTGTGTCTTCTGTT 
SBQ. I.D. No. 3$ - AACTGGGCCTGCAGGGCCAGGAAGCAAAGTTTAAATTCTA 
SBQ. I.D. NO. 37 - CATGTCTGGATCCAACTGATTT 
SBQ. I.D. No. 38 - CTGATTTACTTCTGGGTGACCAGGTTTATTCAA 



SBQ. I.D. No. 39 



5 ' -AAGGGCCAGCTCGGCCTCTTCCTATAATATGATCAATAGTATAAATATTTGTGTTTC- 
TATTTCCAATCTCAGGTGCCAAATGTGACATCCAGATGACCCA-3 ' 

SBQ. I.D. No. 40 

5'- 

TGGGCCTGCAGGGCCAGGAAGCAAAGTTTAAATTCTACTCACGTTTGATTTCCACCTTGG- 
TT-3' 

#1 = B695 = 5' -GGG-AAG-GGC-CAG-CTC-GGC-CTC-TTC-CTA-TAA-TAT- 
GAT-CAA-TAG-TAT-AAA-TAT-TTG-TGT-TTC-TAT-TTC-CAA-TCT-CAG-GTG- 

CCA-AAT-GTG-ACA-TCC-AGA-TGA-CCC-AGT-TTC-CT- 3 (SBQ. I.D 
NO. 41) 

35 #2 = B696 = 5' -GCA-TGC -CGA- AGT-TGG - AGA- AGG -TGA- AGC -CGG - AGG - 

CGC-GGC-AGG-TGA-TGT-TCA-CGC-GGT-CGC-CCA-CGG-AGG-CGG-ACA-GGG- 
TGG-AAG-GAA-ACT-GGG-TCA-TCT-GGA-TGT- 3 (SBQ. I.D. NO. 42)' 
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BS49 = 5' -GGC-TTC-ACC-TTC-TCC-AAC-TTC-GGC-ATG-CAC-TGG-ATC- 
CGC-CAG- AAG-CCC -GGC - AAG -GGC -CTG -AAG -TGG -GTG -GCC -TAC -ATC -TCC - 
GGC-GGC-TCC-TCC-ACC-GTG-CAC-TA- 3 (SBQ. I.D. NO. 43)' 

B550 = 5' -GGT-GAT-GGT-CAG-GAA-CAG-CTC-GTT-CTT-GGG-GTT-GTC- 
GCG-GGA-GAT-GGT-GAA-GCG-GCC-CTT-CAG-GGA-GTC-GGC-GTA-GTG-CAC- 
GGT-GGA-GGA-GCC-GCC-GGA-GAT-GTA- -3 (SBQ. I.D. NO. 44) ' 

B697 = 5* -CCC-CAA-GAA-CGA-GCT-GTT-CCT-GAC-CAT-CAC-CTC-CCT- 
GCA-GCC-CGA-CGA-CTT-CGC-CAT-GTA-CTA-CTG-CGC-CCG-CGA-CTA-CTA- 
CGT-GAA-CAA-CTA-CTG-GTA-CTT-CGA-CGT-GT (SBQ. I.D. NO. 45) 

SBQ. I. D. NO. 46 #6 = B698 = 5' -CAC-AGA-CAA-CTG-GGC-CTG- 

CAG-GGC-CAG-GAA-GCA-AAG-TTT-AAA-TTC-TAC-TCA-CGT-TTTG-ATC-TCC- 

ACC-TTG-GTG-CCC-TGG-CCC-CAC-ACG-TCG-AAG-TAC-CAG-TAG-TT 

SBQ. I.D. No. 47 - 5' -GGG-AAG-GGC -CAG-CTC -GGC -CTC -TT -3' 

SBQ. I.D. No. 48 - 5 ' -CAC-AGA-CAA-CTG-GGC-CTG-CA- 3' 
SBQ I.D. NO. 49 - S'-CAC-CAT CCT GTT TGC TTC TTT CCT CAG 
GAA CTG TGC ACT GGC ACC ACC ACC CAT AGA GGG AGA ACT GCC CCC 
ACC TGC TCC TCA GTT -3 • 

SBQ. I.D. No. 50 
5 ' - GGGTAAAAATAG AATGAAGG ATGATTTTT ATAAAT - 3 • 

SBQ. I.D. No. 51 GAC ATC CAG ATG ACC CAG TTT CCT TCC ACC 
CTG TCC GCC TCC GTG GGC GAC CGC. GTG AAC ATC ACC TGC CGC GCC 
TCC GGC TCC ACC TTC TCC AAC TTC GGC ATG CAC TGG ATC CGC CAG 
AAG CCC GGC AAG GGC CTG AAG TGG GTG GCC TAC ATC TCC GGC GGC 
TCC TCC ACC GTG CAC TAC GCC AAC TCC CTG AAG GGC CGC TTC ACC 
ATC TCC CGC GAC AAC CCC AAG AAC GAG CTG TTC CTG ACC ATC ACC 
TCC CTG CAG CCC GAC GAC TTC GCC ATG TAC TAC TGC GCC CGC GAC 
TAC TAC GTG AAC AAC TAC TGG TAC TTC GAC GTG TGG GGC CAA GGG 
ACC AAG GTG GAA ATC AAA 
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-AAG-AGC-TCC-TGA-ACC-TCG-CGG-ACA-GTT-AA-3 * SBQ. I.D. Ho. 

52 



5 5 ■ -AAA-TCG-ATC-TCA^^ . 
SBQ. i.d. No. 53 

SBQ. I.D. NO. 54 5' -CTG TTT OCT TCT TTC CTC AGG AGG CGG 
TTC AGG AGG ATC AGG CGG TTC AGG TGG ATC AGG AGG CGA CAT CCA 
10 GAT GAC CCA GTC TCC T-3 * 

SBQ. I.D. No. 55 - 5' -GTC AGG CTG GAA CTG AGG AGC AGG TGG 
GGG CAC TTC TCC CTC TAT GGG TGA TGG TGC CAA TGT TTG ATT TCC 
ACC TTG GTC CCT TGG CCG -AA-3 • 



15 



SBQ. I.D. No* 56 - 5 -GAG AAG TGC CCC CAC CTG CTC CTC AGT 
TCC AGC CTG ACC CCC TCC CAT CCT -3 • 



20 GAA ATG CAA CTG 
CGG GCC CTG CGG 
AAC TTT GGA ATG 
GAG TGG GTC GCA 
GCA GAC TCC TTG 

25 AAG AAC TCC CTC 
ACG GCC TTG TAT 
TGG TAC TTC GAT 
TCA G 



SBQ I.D. 
GTG GAA TCT GGG 
CTC TCC TGT GCA 
CAC TGG ATT CGG 
TAC ATT AGT GGT 
AAG GGC CGA TTC 
TAT TTG CAA ATG 
TAC TGT GCA CGG 
GTC TGG GGC CAA 



No. 59 

GGA GGC CTG 
GCC TCT GGA 
CAA ACT CCA 
GGC AGT AGT 
ACC ATC TCC 
ACC AGT CTC 
GAT TAC TAC 
GGG ACA ATG 



CTA CAG 
TTC ACT 
GGG AAG 
ACC GTC 
CGG GAC 
CGG GCT 
GTT AAT 
GTC ATC 



CCT GGC 
TTT AGT 
GGC CTG 
CAC TAT 
AAC GCC 
GAG GAC 
AAC TAC 
GTC TCT 



30 



SBQ I.D. No. 60 

•GAT CCG AAA TGC AAC TGG TGG AAT CTG GGG GAG GCC TGC TAC 
AGC CTG GCC GGG CCC TGC GGC TCT CCT GTG CAG CCT CTG GAT TCA 
CCT TTA G-3 * 
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SBQ I.D. NO. 61 

5'-CAC CAC TAA TGT ATG CGA CCC ACT CCA GGC CCT TCC CTG GAG 
TTT GCC GAA TCC AGT GCA TTC CAA AGT TAC TAA AGG TGA ATC CAG 
AGG C-3' 

SBQ I.D. Mo. 62 

5' -GGG TCG CAT ACA TTA GTG GTG GCA GTA GTA CCG TCC ACT ATG 
CAG ACT CCT TGA AGG GCC GAT TCA CCA TCT CCC GGG ACA ACG CCA 
AGA A 3' 

SBQ I.D. MO. 63 

5 '-TAT TAC TGT GCA CGG GAT TAC TAC GTT AAT AAC TAC TGG TAC 
TTC GAT GTC TGG GGC CCA GGG ACA ATG GTC ATC GTC TCT TCA -3 1 

SBQ I.D. MO. 64 

5' -GTA ATC CCG TGC ACA GTA ATA CAA GGC CGT GTC CTC AGC CCG 
GAG ACT GTT CAT TTG CAA ATA GAG GGA GTT CTT GGC GTT GTC CCG 
GGA G -3' 

SBQ I.D. MO. 65 
5' -AAG GAT CCG AAA TGC AAC TGG TGG AAT CT -3 1 
SBQ I.D. Mo. 66 - GAC GAA TTC TGA AGA GAC GAT GAC CAT TG 
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We Claim: 

1 . A recombinant antibody or antigen binding 
fragment thereof, comprised of at least one light chain 
variable domain, which domain, in turn, comprises three CDRs 
wherein the amino acid sequence of one or more of the CDRs is 
derived from the amino acid sequence of the corresponding 
CDR(s) of a heavy chain variable domain of one (donor) 
antibody and further comprises four framework regions wherein 
the amino acid sequence of one or more of the framework 
regions is derived from the amino acid sequence of the 
corresponding framework region(s) from the light chain 
variable domain of the same or a different (acceptor) 
antibody. 

2. a recombinant antibody or antigen binding 
fragment thereof of claim 1, wherein the antibody or antigen 
binding fragment thereof is selected from the group 
consisting of: 

a) a CSV L fragment; 

b) a heavy body [CSV L --C L ] ; 

O a kappa body fragment rcDR-grafted V L — c L 

" CSV L -C L ]; 

d) an intact kappa body {2X [CDR-grafted V L - 
C L I I CSV L — c H ] ) ; or 

e) an ScFv(CSV L ) fragment {either CDR-grafted 
v L - linker —csv L or CSV L —linker— cdr- 
grafted v L J 



15 



25 



30 



3. A recombinant fragment of claim 2. wherein the 
donor and acceptor antibodies are independently chosen from 
the group consisting of murine, rabbit, and primate 
antibodies. 

4. A recombinant antibody fragment of claim 3 
wherein the amino acid sequences of all three CDRs of the ' 
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CSVL domain are derived from the amino acid sequences of the 
corresponding CDRs of the heavy chain variable domain of the 
donor antibody and ,the amino acid sequences of all four 
framework regions of the CSVL domain are derived from the 
amino acid sequences of the corresponding framework regions 
of the light chain variable domain of the acceptor antibody. 

5* A recombinant antibody fragment of claim 4, 
wherein the acceptor antibody is human. 

6. A recombinant antibody fragment of claim 5, 
wherein the human acceptor antibody has light chains of the 
kappa class. 

15 7. A recombinant antibody fragment of claim 6, 

wherein the acceptor antibody is of the igQ class. 

8. A recombinant antibody fragment of claim 7, 
wherein the donor antibody is murine. 

20 

9. A recombinant antibody fragment of claim 8, 
wherein the donor murine antibody has affinity for tumor 
antigens or antigens on thrombi. 

25 10. A recombinant antibody fragment of claim 9, 

wherein the donor murine antibody fragment has affinity for 
tumor antigens. 



11. A recombinant antibody fragment of claim 10, 
30 wherein the donor murine antibody has affinity for the tumor 
markers chosen from the group consisting of AFP, CA-125, CEA, 
Neuron Specific Enolase, C-erb2/Her-2/NEU protein, Cathepsin 
D, Chromagranins A, B, and C, the Cytokeratins, Epidermal 
Growth Factor Receptor, Epithelial Membrane Antigen, Estrogen 
35 Receptor, Progesterone Receptor, Prostatic Acid Phosphatase, 
Prostate Specific Antigen, Ki67, PGP-170 (MDR) , PGP-180 
(MDR), pl20. Proliferating Cell Nuclear Antigen, Vimentin, 
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and the proteins expressed by the c-myc. N-myc. N-ras, Ki-ras 
and Ha-ras oncogenes. 

12. A recombinant antibody fragment of claim 11. 
5 wherein the donor murine antibody has affinity for the tumor 
antigen CEA. 



10 



13. A recombinant antibody fragment of claim 12, 
wherein the donor antibody is ZCE 025 or CEM 231. 

14. a recombinant antibody fragment of claim 13, 
wherein the acceptor antibody is im9. and the framework 
regions are mostly the same as the corresponding IMS 
framework regions. 

15. A recombinant antibody fragment of claim 14, 
wherein the donor antibody is ZCE 025. 

16. A recombinant antibody fragment of claim 2. 
wherein the C-terminus or N-terminus of the fragment molecule 
is fused to a metal chelating peptide. 

17 . A recombinant antibody fragment of claim 16 
wherein the metal chelating peptide has the amino acid 
sequence HWHHHP (Sequence I.d.Ho. 2) and is fused to the C- 
terminus of the fragment molecule through the N-terminal 
histidine residue of the chelating peptide. 

18. A recombinant antibody fragment of claim 4 
wherein the fragment is selected from the group consisting 'of 
a) kappabody fragment and b) intact kappabody, the amino acid 
sequences of all three CDRs of the CDR-Grafted V L domain are 
derived from the amino acid sequences of the corresponding 
CDRs of the V L domain of the CDR-Grafted donor antibody, the 
amino acid sequences of all four framework regions of the 
CDR-Grafted v L domain are derived from the amino acid 
sequences of the corresponding framework regions of the V L 
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domain of the CDR-Grafted acceptor antibody, and the lighz 
chain constant domains are identical in sequence to the 
corresponding constant domains of the acceptor antibody or 
antibodies. 

19. A recombinant antibody fragment of claim 18, 
wherein the amino acid sequence of the framework regions of 
the CSV L and the CDR-Grafted V L and the C L domains are derived 
from the same human acceptor antibody. 

20. A recombinant antibody fragment of claim 19 , 
wherein the framework regions of the CSV L and the CDR-grafted 
V L> as well as the complete C L domains are derived from the 
corresponding regions and domains of a human acceptor 
antibody whose light chain is of the kappa class. 

21. a recombinant antibody fragment of claim 20, 
wherein the acceptor antibody is of the IgG class. 

22. a recombinant antibody fragment of claim 21, 
wherein the donor antibody for both the CSV L and the CDR- 
grafted Vl is the same murine antibody. 

23. A recombinant antibody fragment of claim 22, 
wherein the donor murine antibody has affinity for a tumor 
antigen or an antigen on thrombi. 

24. a recombinant antibody fragment of claim 23, 
wherein the donor murine antibody has affinity for a tumor 
antigen. 

25. A recombinant antibody fragment of claim 24, 
wherein both donor murine antibodies have affinity for tumor 
antigens chosen from the group consisting of AFP, CA-125, CEA, 
Neuron Specific Enolase, C-erb2/Her-2/NEU protein, Cathepsin 
D, Chromagranins A, B, and C, the Cytokeratins , Epidermal 
Growth Factor Receptor, Epithelial Membrane Antigen, Estrogen 
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Receptor. Progesterone Receptor, Prostatic Acid Phosphatase, 
Prostate Specific Antigen, Ki-67. pgp-170 (MDR) , pgp-180 
(MDR) , pl20. Proliferating Cell Nuclear Antigen, Vimentin, 
and the proteins expressed by the c-myc. N-myc, N-ras. Ki-ras 
and Ha-ras oncogenes. 

26. A recombinant antibody fragment of claim 25, 
wherein the donor murine antibody has affinity for the tumor 
antigen CEA. 

27. a recombinant antibody fragment of claim 26, 
wherein the donor murine antibody is either ZCE 025 or CEM 
231.1. 

28. A recombinant antibody fragment of claim 27, 
wherein the human acceptor antibody is IM9, and the amino acid 
sequences of both sets of framework regions are derived from 
the amino acid sequences of the corresponding IM9 light chain 
framework regions. 

29. A recombinant antibody fragment of claim 28, 
wherein the murine donor antibody is ZCE 025. 

30. A recombinant antibody fragment of claim 29. 
wherein the C-terminus or the N-tenninus of either the CSV L 
-containing or the CDR-Grafted - containing chain of the 
fragment molecule is fused to a metal chelating peptide. 

31. A recombinant antibody fragment of claim 30, 
wherein the metal chelating peptide has the amino acid 
sequence HWHHHP and is fused to the C-terminus of the CSV L 
containing chain of the fragment molecule through the N- 
terminal histidine residue of the chelating peptide. 

32. A recombinant antibody or fragment thereof of 
claim 2, wherein the antibody or fragment thereof is the 
fragment ScFv(csv L ), wherein a CDR-Grafted v L domain is 
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covalently bonded to a CSV L domain through a polypeptide 
linker. 



33. A recombinant fragment of claim 32. wherein 
the donor and acceptor antibodies are independently chosen 
from the group consisting of murine, rabbit, and primate 
antibodies. 



34. A recombinant antibody fragment of claim 33, 

wherein 

a) The amino acid sequence of all three CDRs of 
the CSV L domain derived from those of the 
corresponding CDRs of the heavy chain variable 
domain of the donor antibody used; 

b) The amino acid sequence of all four CSV L 
framework regions are derived from those of 
the corresponding framework regions of the 
light chain variable domain of the acceptor 
antibody used; 



c) The amino acid sequences of all three CDRs of 
the CDR-Grafted V L domain are derived from 
those of the corresponding CDRs of the V L 
domain of the donor antibody used; and 

d) The amino acid sequences of all four framework 
regions of the CDR-Grafted V L domain are 
derived from those of the corresponding 
framework regions of the V L domain of the 
acceptor antibocfy. 



35. A recombinant antibody fragment of claim 34, 
wherein the amino acid sequences of the framework regions of 
both the CSV L and the CDR-Grafted V L are derived from those 
of the corresponding framework regions of the same human 
acceptor antibody. 
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36. A recombinant antibody fragment of claim 35, 
wherein the polypeptide linker is composed of about 12 to 
about 18 amino acids. 

37. A recombinant antibody fragment of claim 36, 
wherein the C-terminus of the CDR-Grafted V L domain is fused 
to the N- terminus of the polypeptide linker, and wherein the 
C-terminus of the polypeptide linker is bonded to the N- 
terminus of the CSV L domain. 

38. A recombinant antibody fragment of claim 37, 
wherein the donor antibody is murine. 

39. A recombinant antibody fragment of claim 38, 
wherein the donor murine antibody has affinity for tumor 
antigens or antigens on thrombi. 

40. A recombinant antibody fragment of claim 39, 
wherein the donor murine antibody has affinity for a tumor 
antigen. 

41. a recombinant antibody fragment of claim 40, 
wherein the donor murine antibody has affinity for tumor 
antigens chosen from the group consisting of AFP, CA-125, 
CEA. Neuron Specific Enolase, C-erb2/Her-2/NEU protein, 
Cathepsin D, chromagranins A, B, and C, the Cytokeratins. 
Epidermal Growth Factor Receptor, Epithelial Membrane 
Antigen, Estrogen Receptor, Progesterone Receptor, Prostatic 
Acid Phosphatase, Prostate Specific Antigen, Ki-67, PGP-170 
(MDR), PGP-180 (MDR), pi 2 0. Proliferating Cell Nuclear 
Antigen, Vimentin, and the proteins expressed by the c-myc, 
N-myc, N-ras, Ki-ras and Ha-ras oncogenes 

42. A recombinant antibody fragment of claim 41, 
wherein the donor murine antibody has affinity for the tumor 
antigen CEA. 
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43. A recombinant antibody fragment of claim 42, 
wherein the linker polypeptide is composed of serine and 
glycine amino acid residues. 



44. A recombinant antibody fragment of claim 43, 
wherein the donor murine antibody is either ZCE 025 or CEM 
231.1. 

45. A recombinant antibody fragment of claim 44, 
wherein the human acceptor antibody is IM9, and both sets of 
framework regions are mostly the same as the corresponding 
IM9 light chain framework regions. 

46 . A recombinant antibody fragment of claim 45, 
wherein the murine donor antibody is 2CE025. 

47. A recombinant antibody fragment of claim 46, 
wherein the linker polypeptide is of the formula 
-GGSGGSGGSGGSGG- . 



48. A recombinant antibody fragment of claim 47, 
wherein the C-terminus or the N-terminus of the ScFv(C s V L ) is 
fused to a metal chelating peptide. 

49. A recombinant antibody fragment of claim 48 
wherein the metal chelating peptide has the amino acid 
sequence HWHHHP and is fused to the C-terminus of the CSV L 
domain of the fragment molecule through the N-terminal 
histidine residue of the chelating peptide. 

50. A DNA or RNA sequence coding for a recombinant 
antibody or fragment thereof, wherein the antibody or 
fragment thereof is comprised of at least one light chain 
variable domain, which domain, in turn, comprises three CDRs 
wherein the amino acid sequence of one or more of the CDRs is 
derived from the amino acid sequence of the corresponding 
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CDR(s) of a heavy chain variable domain of one (donor) 
antibody and further comprises four framework regions wherei 
the amino acid sequence of one or more of the framework 
regions are derived from the amino acid sequence of the 
corresponding framework region (s) from the light chain 
variable domain of the same or a different (acceptor) 
antibody . 



SX. A DNA or RNA sequence coding for a recombinant 
10 antibody or antigen binding fragment thereof of claim SO, 
wherein the recombinant antibody or antigen binding fragment 
thereof is selected from the group consisting of: 

a) a CSV L fragment; 
15 b > a heavy body tcsv L — c L ] ; 

c) a kappa body fragment (CDR-grafted V L — C L 1 1 
CSV L — c L ] ; 

an intact kappa body <2x [CDR-grafted V L --C L 
II CSV L — C H ]); or 

an ScPv(CSV L ) fragment [either CDR-grafted 
V L — linker— CSV L or CSV L —linker-- CDR- 
grafted V L J 



d) 

20 e) 



52. a DNA or RNA sequence of claim 51, wherein in 
25 the recombinant fragment the donor and acceptor antibodies 
that are coded for are independently chosen from the group 
consisting of murine, rabbit, and primate antibodies. 



30 



35 



53. a DNA or RNA sequence of claim 52, wherein the 
amino acid sequences of all three CDRs of the CSVL domain 
that are coded for are derived from the amino acid sequences 
of the corresponding CDRs of the heavy chain variable domain 
of the donor antibody and the amino acid sequences of all 
four framework regions of the CSVL are derived from the amino 
acid sequences of the corresponding framework regions of the 
light chain variable domain of the acceptor antibody. 
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54. A DNA or RNA sequence of claim 53, wherein the 
acceptor antibody that is coded for is human. 

55. A DNA or RNA sequence of claim 54, wherein the 
human acceptor antibody that is coded for has light chains of 
the kappa class. 



56. A DNA or RNA sequence of claim 55, wherein the 
acceptor antibody that is coded for is of the igG class. 

10 

57. a DNA or RNA sequence of claim 56, wherein the 
donor antibody that is coded for is murine . 

58. A DNA or RNA sequence of claim 57, wherein the 
15 donor murine antibody that is coded for has affinity for 

tumor antigens or antigens on thrombi. 

59. A DNA or RNA sequence of claim 58, wherein the 
donor murine antibody fragment that is coded for has affinity 

20 for tumor antigens. 



60. a DNA or RNA sequence of claim 59, wherein the 
donor murine antibody that is coded for has affinity for the 
tumor markers chosen from the group consisting of AFP, CA- 

25 125, CEA, Neuron Specific Enolase, C-erb2/Her-2/NEU protein, 
cathepsin D, Chromagranins A, B, and C, the Cytokeratins, 
Epidermal Growth Factor Receptor, Epithelial Membrane 
Antigen, Estrogen Receptor, Progesterone Receptor, Prostatic 
Acid Phosphatase, Prostate Specific Antigen, Ki-67, PGP-170 

30 (MDR), PGP-180 (MDR), pl20, Proliferating Cell Nuclear 

Antigen, vimentin, and the proteins expressed by the c-myc, 
N-myc, N-ras, Ki-ras and Ha-ras oncogenes. 



35 



61. A DNA or RNA sequence of claim 60, wherein 
the donor murine antibody that is coded for has 
affinity for the tumor antigen CEA, 
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62. A DNA or RNA sequence of claim 61, wherein the 
donor antibody that is coded for is ZCE 025 or CEM 231. 

s 63. A una or RNA sequence of claim 62, wherein the 

acceptor antibody that is coded for is m9. and the framework 
regions are mostly the same as the corresponding IM9 
framework regions. 

10 64. a DNA or RNA sequence of claim 63, wherein the 

donor antibody that is coded for is ZCE 025. 

65. A DNA or RNA sequence of claim 51 wherein the 
5 '-terminus or 3 • -terminus of the DNA or RNA coding for the 
15 fragment molecule is fused to a DNA or RNA sequence 
respectively, coding for metal chelating peptide. 



20 



25 



30 



66. A DNA or rna sequence of claim 65, wherein the 
metal chelating peptide that is coded for has the amino acid 
sequence HWHHHP (sequence i.D.No.) and is fused to the C- 
terminus of the fragment molecule through the N-terminal 
histidine residue of the chelating peptide. 

67. A DNA or RNA sequence of claim 51. whereinthe 
fragment encoded is selected from the group consisting of a) 
kappabody and b) intact kappabody and the amino acid 
sequences of all three CDRs that are coded for of the CDR- 
Grafted V L domain are derived from the amino acid sequences 
of the corresponding CDRs of the V L domain of the CDR-Grafted 
donor antibody and the amino acid sequences of all four 
framework regions that are coded for of the CDR-Grafted V L 
domain are derived from the amino acid sequences of the 
corresponding framework regions of the v L domain of the CDR- 
Grafted acceptor antibody, and the light chain constant 
domains that are coded for are identical in sequence to the 
corresponding constant domains of the acceptor antibody or 
antibodies. 
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68. A DNA or RNA sequence of claim 67, wherein the 
amino acid sequences of the framework regions of both the 
CSV L and the CDR-Grafted Vl and the amino acid sequence of 
the Cl domains coded for acceptor antibodies are derived from 
the same human acceptor antibody. 

69. A DNA or RNA sequence of claim 68, wherein the 
amino acid sequence of the framework regions of the CSV L and 
the CDR-Graf ted V L and the amino acid sequence of the C L 
domains that are coded for are derived from the corresponding 
regions and domains of a human acceptor antibody whose light 
chain is of the kappa class. 

70. A DNA or RNA sequence of claim 69, wherein the 
acceptor antibody that is used is of the IgG class. 

71. A DNA or RNA sequence of claim 70, wherein the 
donor antibody used for both is the same murine antibody. 

72. a DNA or RNA sequence of claim 71, wherein the 
donor murine antibody used has affinity for a tumor antigen 
or an antigen on thrombi* 

73. a DNA or RNA sequence of claim 72, wherein the 
donor murine antibody used has affinity for a tumor antigen. 

74. A DNA or RNA sequence of claim 73, wherein the 
donor murine antibody that is used has affinity for tumor 
antigens chosen from the group consisting of AFP, CA-125, 
CEA, Neuron Specific Enolase, C-erb2/Her-2/NEU protein, 
Cathepsin D, Chromagranins A, B, and C, the Cytokeratins, 
Epidermal Growth Factor Receptor, Epithelial Membrane 
Antigen, Estrogen Receptor. Progesterone Receptor, Prostatic 
Acid Phosphatase, Prostate Specific Antigen, Ki-67, PGP-170 
(MDR). PGP-180 (MDR), pl2D, Proliferating Cell Nuclear 
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Antigen. Vimentin, and the proteins expressed by the c-myc, 
N-myc. N-ras, Ki-ras and Ha-ras oncogenes. 

75. A DNA or RKA sequence of claim 74, wherein the 
5 donor murine antibody used has affinity for the tumor antigen 

CEA . 

76. A DNA or RNA sequence of claim 75, wherein the 
donor murine antibody used is either 2CE 025 or CEM 231.1. 

10 77. a DNA or RNA sequence of claim 76, wherein 

the human acceptor antibody used is 1M9, and both 
sets of framework regions are derived from the corresponding 
IM9 light chain framework regions. 



15 
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78. A DNA or RNA sequence of claim 77. wherein the 
murine donor antibody used is ZCE 025. 

r .™- 19 ' ? ^ ° r se< J uence of claim 78, wherein the 
C-terminus or the N-terminus of either the CSV L .containing 

or the CDR-Grafted - containing chain of the fragment 
molecule is fused to a metal chelating peptide. 

80. A DNA or RNA sequence of claim 79, wherein the 
metal chelating peptide that is coded for has the amino- acid 
sequence HWHHHP and is fused to the C-terminus of the CSV L 
containing chain of the fragment molecule through the N- 
terminal histidine residue of the chelating peptide. 

81. A DNA or RNA sequence coding for a recombinant 
antibody or fragment thereof of claim 51. wherein the 
antibody or fragment thereof that is coded for is the 
fragment ScFv(CSV L ) , wherein a CDR-Grafted v L domain is 
covalently bonded to a CSV L domain through a polypeptide 
linker. 

82. a una or RNA sequence of claim 81, wherein the 
donor and acceptor antibodies that are coded for are • 
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independently chosen from the group consisting of murine, 
rabbit, and primate antibodies. 

83. A DNA or RNA sequence of claim 82, wherein 

a) The amino acid sequences of all three 
CDRs of the CSV L domain that are coded 

for are derived from those of the 
corresponding CDRs of the heavy chain 
variable domain of the donor antibody 
used; 

b) The amino acid sequences of all four CSV L 
framework regions that are coded for are 
derived from those of the corresponding 
framework regions of the light chain 
variable domain of the acceptor antibody 
used; 

c) The amino acid sequences of all three 
CDRs of the CDR-Graf ted V L domain that 
are coded for are derived from those of 
the corresponding CDRs of the V L domain 
of the donor antibody used; and 

d) The amino acid sequences of all four 
framework regions of the CDR-Graf ted v L 
domain that are coded for are derived 
from those of the corresponding framework 
regions of the V L domain of the acceptor 
antibody used. 

84. A DNA or RNA sequence of claim 83, wherein the 
amino acid sequences of the framework regions of both the 
CSV L and the CDR-Graf ted V L are derived from those of the 
corresponding framework regions of the same human acceptor 
antibody. 



85. A DNA or RNA sequence of claim 84, wherein the 
polypeptide linker that is coded for is composed of about 12 
to about 18 amino acids. 
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86. A DNA or RNA sequence of claim 8S, wherein the 
C-terminus of the CDR-Grafted V L domain that is coded for is 
fused to the N-terminus of the polypeptide linker, and 
wherein the C-terminus of the polypeptide linker that is 
coded for is fused to the N-terminus of the CSV L domain. 

87. a DNA or RNA sequence of claim 86, wherein the 
donor antibody that is used is murine. 

88. A DNA or RNA sequence of claim 87, wherein the 
donor murine antibody that is used has affinity for tumor 
antigens or antigens on thrombi. 

15 89. A DNA or RNA sequence of claim 88, wherein the 

donor murine antibody that is used has affinity for a tumor 
antigen. 
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90. A DNA or RNA sequence of claim 89, wherein the 
donor murine antibody that is used has affinity for tumor 
antigens chosen from the group consisting of APP, CA-125, 
CEA, Neuron Specific Enolase, C-erb2/Her-2/NEU protein, 
Cathepsin d. Chromagranins A, b, and C, the Cytokeratins, 
Epidermal Growth Factor Receptor, Epithelial Membrane 
Antigen, Estrogen Receptor, Progesterone Receptor, Prostatic 
Acid Phosphatase, Prostate Specific Antigen, Ki-67, PGP-170 
(MDR), pgp-180 CMDR), pl20, Proliferating Cell Nuclear 
Antigen, Vimentin, and the proteins expressed by the c-myc, 
N-myc, N-ras, Ki-ras and Ha-ras oncogenes 

91. A DNA or RNA sequence of claim 90, wherein the 
donor murine antibody used has affinity for the tumor antigen 
CEA. 



35 92 . A DNA or RNA sequence of claim 91. wherein the 

linker polypeptide that is coded for is composed of serine 
and glycine amino acid residues. 



WO 96/06625 



PCT/US95/10791 



- 142 - 



93. A DNA or RNA sequence of claim 92, wherein the 
donor murine antibody used is either ZCE 025 or CEM 231.1. 

5 94. A DNA or RNA sequence of claim 93, wherein the 

human acceptor antibody used is IM9, and the amino acid 
sequences of both the CSV L and CDR-Grafted V L framework 
regions that are coded for are derived from those of the 
corresponding IM9 light chain framework regions. 

10 

95. A DNA or RNA sequence of claim 94 , wherein the 
murine donor antibody used is 2CE025. 

96. A DNA or RNA sequence of claim 95, wherein the 
15 linker polypeptide that is coded for is of the formula 

-GGSGGSGGSGGSGG- . 



97. A DNA or RNA sequence of claim 96, wherein the 
C-terminus or the N-terminus of the ScFr(CSV L ) that is coded 

20 for is fused to a metal chelating peptide. 

98. A DNA or RNA sequence of claim 97 wherein the 
metal chelating peptide that is coded for has the amino acid 
sequence HWHHHP and is fused to the C-terminus of the CSV L 

25 domain of the fragment molecule through the N-terminal 
histidine residue of the chelating peptide. 
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